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Abstract 

We  analyze  a  sequential  bargaining  model,  where  players  are  allowed  to  hold  different  beliefs 
about  which  player  will  make  an  offer  and  when.  Excessive  optimism  about  making  offers  in 
the  future  can  cause  a  delay  in  agreement.  Despite  this,  the  main  result  states  that,  if  players 
will  remain  sufficiently  optimistic  for  a  sufficiently  long  future,  then  in  equilibrium  they  will 
agree  immediately. 


1      Introduction 

Considering  a  strict  procedure  that  determines  which  player  will  make  an  offer  and  when,  Stahl 
(1972)  and  Rubinstein  (1982)  show  that,  when  the  delay  is  costly,  in  equilibrium,  two  players 
will  reach  an  agreement  immediately.  Nevertheless,  when  there  are  no  such  strict  procedures, 
players  may  hold  any  beliefs  about  the  negotiation  process,  thereby  holding  any  beliefs  about 
what  each  player  will  get  in  case  of  a  delay.  In  particular,  each  player  may  be  so  optimistic 
about  what  he  will  get  in  case  of  a  delay  that  they  may  not  reach  an  agreement  at  the  beginning. 

In  this  paper,  we  will  analyze  the  problem  of  reaching  an  agreement  in  a  model  that 
extends  the  Rubinstein-Stahl  framework,  where  we  will  allow  players  to  hold  their  own  (possibly 
optimistic)  beliefs  about  the  recognition  process  determining  which  player  will  make  an  offer 
when.  We  take  the  lack  of  a  common  prior  as  the  only  source  of  the  differences  in  beliefs.' 

In  our  model,  the  recognition  process  is  the  only  source  of  bargaining  power.  In  equilibrium, 
the  recognized  player  at  a  given  date  extracts  a  non-informational  rent,  as  he  makes  an  offer 
that  can  be  rejected  only  by  destroying  some  of  the  pie.  These  rents  constitute  the  bargaining 
power:  a  player's  continuation  value  is  the  present  value  of  the  rents  he  expects  to  extract 
when  he  is  recognized  in  the  future.  Therefore,  our  analysis  will  help  us  to  explore  the  broader 
question  of  when  two  rational  individuals  can  reach  an  agreement  even  if  they  hold  incompatible 
beliefs  about  which  parties  will  bring  which  advantages  to  the  table  if  they  keep  bargaining. 

In  our  model,  we  have  optimism  about  a  date  t  whenever  each  player  thinks  that  the 
probability  that  he  will  be  recognized  (and  hence  will  make  an  offer)  at  t  is  higher  than  what 
the  other  player  assesses.  We  measure  the  level  of  optimism  about  t  hy  yt  =  pj  +  p^  —  1,  where 
pI  is  the  probability  player  i  assigns  to  the  event  that  i  is  recognized  at  t. 

Excessive  optimism  can  cause  a  delay.  To  see  this,  consider  the  case  where  two  risk-neutral 
players  are  trying  to  divide  a  dollar,  which  is  worth  1  at  t  —  0,  8  E  (1/2, 1)  at  i  =  1,  and  zero 
afterwards.  It  is  also  common  knowledge  that  each  player  beheves  with  probability  1  that  he 
will  be  recognized  (and  hence  will  make  an  offer)  at  i  =  1  independent  of  recognition  at  t  =  0. 
Since  the  dollar  is  worth  zero  afterwards,  at  i  =  1,  each  player  is  willing  to  accept  any  division, 
hence  the  recognized  player  takes  the  whole  dollar.  Anticipating  this,  at  t  =  0,  each  player 
expects  to  take  the  whole  dollar  next  day,  which  is  worth  6.  Thus,  they  can  agree  on  a  division 
at  f  =  0  only  if  each  gets  at  least  6,  requiring  a  minimum  total  amount  of  26  >  1.  Since  they 
have  only  one  dollar,  they  cannot  reach  an  agreement  at  i  =  0.  Therefore,  in  equilibrium,  it  is 
common  knowledge  at  the  beginning  that  players  will  not  reach  an  agreement  before  i  =  1. 


'If  players'  beliefs  are  common  knowledge  and  the  common-prior  assumption  holds,  then  they  must  hold  the 
same  beUefs,  as  shown  by  Aumann  (1976).  Hence,  the  differences  between  players'  behefs  may  be  caused  either 
by  lack  of  common  knowledge  (as  in  Myerson  (1979)  and  Kennan  and  Wilson,  1993),  or  by  lack  of  a  common 
prior  (as  in  Hicks  (1932),  Landes  (1971),  Gould  (1973),  Posner  (1974),  and  Farber  and  Bazerman,  1989).  For 
plausibility  of  the  common  prior  assumption,  see  Kreps  (1990),  Aumann  (1998)  and  Gul  (1998). 


In  contrast  with  this  Example,  our  main  result  is  an  Agreement  Theorem.  It  states  that, 
if  the  players  will  remain  sufficiently  optimistic  for  a  sufficiently  long  while,  in  equilibrium, 
they  will  reach  an  agreement  immediately.  That  is,  the  excessive  optimism  alone  cannot  be  a 
reason  for  a  delay  in  agreement.  The  reason  is  simple:  A  player  who  is  optimistic  about  the 
future  will  not  settle  for  a  small  share,  even  if  his  opponent  beheves  that  he  would  get  even 
less  if  they  wait.  Hence,  when  his  opponent  recognizes  that  he  will  remain  very  optimistic  for 
a  very  long  while,  she  will  lower  her  expectations.  In  equilibrium,  their  expectations  will  be  so 
low  that  they  will  agree  immediately. 

These  two  seemingly  conflicting  results  share  a  common  intuition.  As  in  our  Example, 
players  will  delay  the  agreement  if  they  are  very  optimistic  about  getting  a  very  high  rent  in 
the  near  future.  But  the  size  of  this  rent  depends  upon  players'  expectations  in  the  future. 
Particularly,  if  their  expectations  about  the  rent  at  t  +  1  are  high,  the  rent  at  t  will  be  small.  If 
the  players  will  remain  sufficiently  optimistic  for  a  sufficiently  long  while,  in  equilibrium,  the 
rents  in  the  near  future  will  be  so  small  that  each  player  will  prefer  to  agree  on  his  opponents 
terms  rather  than  waiting  and  getting  these  rents.  Hence  our  Agreement  Theorem. 

As  our  Example  indicates,  it  is  crucial  for  our  Agreement  Theorem  that  the  optimism  stays 
high  for  a  long  while.  Generahzing  this  example,  in  Subsection  3.4  we  show  that,  if  players 
are  excessively  optimistic  for  some  t*  in  the  near  future,  while  they  are  excessively  pessimistic 
for  t*  +  1  (i.e.,  ii  yt'  is  very  high  while  yt'+i  is  very  low),  then  the  agreement  may  be  delayed 
until  t*.  Here,  t*  need  not  be  very  small:  the  delay  can  be  so  long  that  almost  half  the  gains 
from  trade  is  lost.^ 

In  order  to  cause  such  a  long  delay,  the  change  in  the  level  of  optimism  must  also  be 
abrupt.  For  the  special  case  of  transferrable  utility  and  independently-distributed  recognition 
process.  Theorem  2  states  that  they  will  reach  an  agreement  immediately,  so  long  as  the  level  of 
optimism  does  not  drop  substantially  at  any  given  date  (i.e.,  so  long  as  yt  —  yt+i  is  sufficiently 
low  at  each  t) ,  relaxing  the  hypothesis  of  our  Agreement  Theorem  in  this  case. 

Theorem  2  identifies  the  assumption  behind  the  immediate  agreement  result  in  the  Rubinstein- 
Stahl  framework.    It  is  that  the  level  of  optimism  stays  constant  -  not  necessarily  that  the 
optimism  is  absent. 

These  results  also  shed  some  light  on  the  deadline  effect  observed  in  some  bargaining 
experiments  (Roth  et  al,  1988)  and  in  some  real-world  negotiations,  where  players  commonly 
settle  just  before  the  deadline.  Above,  t*  may  be  a  deadline,  after  which  players  do  not  hope 
to  make  any  offer.  Hence,  if  the  players  are  very  optimistic  about  making  the  final  offer  at 
t*,  by  Subsection  3.4,  players  will  settle  just  before  the  deadline,  as  it  is  observed.  On  the 
other  hand.  Theorem  2  suggests  a  further  test  for  our  theory:  when  the  deadhne  is  random 


This  is  the  theoretical  bound  of  the  length  of  deterministic  delays.  When  people  learn  from  the  past,  the 
agreement  depends  upon  the  history,  in  which  case  the  length  of  the  delay  can  be  unbounded  (Example  1). 


and  players  can  make  offers  very  fast,  the  deadline  effect  must  disappear. 

By  requiring  the  level  of  optimism  to  stay  high  for  a  long  while,  our  Agreement  Theorem 
implicitly  requires  that  players'  beliefs  are  sufficiently  firm.  That  is,  they  do  not  update  their 
beliefs  drastically  as  they  observe  who  gets  a  chance  to  make  an  offer  and  when.  (We  will  call 
such  change  of  beliefs  "learning.")  If  a  player  i  is  known  to  update  his  beliefs  fast,  the  other 
player,  say  j,  may  believe  that,  if  they  disagree  now,  in  a  short  while  i  will  realize  that  he  is 
wrong,  and  thus  be  willing  to  agree  on  j's  terms.  In  that  case,  j's  expectation  about  the  future 
will  be  higher  than  what  it  would  be  if  z's  behefs  were  very  firm.  But  i  does  not  find  it  very 
hkely  that  he  will  change  his  mind;  he  might  even  expect  that  j  will  change  her  mind.  This 
may  cause  a  disagreement. 

Furthermore,  in  a  typical  learning  process,  agents  learn  faster  at  the  beginning,  in  which 
case  the  disagreement  will  be  at  the  beginning  of  the  game,  and  hence  we  will  observe  a  delay  in 
agreement  on  the  path  of  equilibrium.  Considering  a  canonical  model  of  learning  and  confining 
ourself  to  the  transferrable-utility  case,  we  show  for  the  infinite-horizon  case  that  there  exists 
some  date  t**  such  that  players  will  never  reach  an  agreement  before  t**,  and  they  will  always 
reach  an  agreement  thereafter.  We  further  demonstrate  that,  when  players'  initial  beliefs  are 
not  firm,  t**  can  be  large,  generating  long  delays  in  equihbrium.  It  is  common  knowledge  at 
the  beginning  of  the  game  that  they  will  disagree  until  t**,  distinguishing  our  theorem  from 
usual  delay  results  in  bargaining  with  private  information. 

The  outline  of  the  paper  is  as  follows.  In  the  next  section,  we  lay  out  or  model.  In  Section  3, 
we  analyze  the  special  case  of  transferrable  utility  and  independently-distributed  recognition 
process.  There,  we  use  only  simple  algebra  to  derive  our  main  results.  We  describe  the 
equilibria  in  Section  4,  and  present  our  Agreement  Theorem  in  Section  5.  Section  6  contains 
the  analysis  of  our  canonical  model  of  learning.  After  discussing  the  impact  of  risk-aversion 
on  reaching  an  agreement  in  Section  7,  we  conclude  in  Section  8. 

2     Model 

In  this  section  we  will  lay  out  our  model.   We  will  write  K*^  for  any  /c-dimensional  Euclidean 
space,  N  for  the  set  of  non-negative  integers. 

We  take  a  grid  T  =  {t  e  N\t  <t}  tohe  our  time  space  for  some  i  <  oo,  a  set  A'^  =  {1, 2}  to 
be  our  set  of  players,  and  a  compact  and  convex  set  U  cM?  to  be  the  set  of  all  feasible  expected 
utihty  pairs.  Throughout  the  paper,  we  assume  that  U  contains  0  and  at  least  another  member 
that  is  strictly  greater  than  0.  We  will  further  assume  the  following  the  following  regularity 
condition. 

Assumption  FDA     Given  any  (u^,u^)  G  U,  any  distinct  i,j  €  A'^,  and  any  v'  G  [0,tx^),  there 
exists  some  v^  >  u^  such  that  (w^u^)  G  U. 


That  is,  the  frontier  of  U  is  decreasing  on  the  non-negative  quadrant.  This  condition  is 
satisfied  under  the  free-disposal  assumption  with  locally  insatiable  preferences. 

We  will  analyze  the  following  game,  denoted  by  G^[5,p].  At  each  t  eT,  Nature  recognizes 
a  player  i  £  N;  i  offers  an  alternative  u  =  (i/\u^)  G  U;  if  the  other  player  accepts  the  offer, 
then  the  game  ends  yielding  a  payoff  vector  8''u  =  {6^u^,S^u'^)  for  some  6  S  (0, 1);  otherwise, 
the  game  proceeds  to  date  t  +  1,  except  for  t  =  i  —  1,  when  the  game  ends  yielding  payoff 
vector  (0, 0).  We  write  p  =  {pt}t^T  ^°^  ^^^  publicly  observed  stochastic  process  that  recognizes 
the  players  (so  that  they  can  make  offers),  Pj  =  (/Oq(w),Pi(cj),.  . .  ,pj_j(w))  6  A''*  for  a  generic 
history  of  the  recognized  players  at  the  beginning  of  date  t,  and  P^  i'lPs)  for  an  agent  i's 
conditional  beliefs  at  any  history  p^  6  TV*.  Players  recall  everything  happened  in  the  past;  and 
everything  described  in  this  paragraph  is  common  knowledge. 

Notice  that  we  have  two  sets  of  beliefs,  one  for  each  player;  this  is  our  only  departure  from 
the  Rubinstein-Stahl  framework.^ 

Given  any  i  e  N  and  any  history  p^  E  N^,  we  write 

pliPs)  =  P'{Pt  =  APs) 

for  the  probability  player  i  assigns  to  the  event  that  he  is  going  to  be  recognized  at  date  t>s. 
By  a  belief  structure,  we  mean  any  full  list  p  =  {pj  (Ps)}  of  such  probability  assessments. 
We  will  also  sometimes  write  G*  [6,p]  for  the  game  where  the  belief  structure  is  p. 

Take  any  date  t  E  T  and  any  history  p^  G  A''^  with  s  <  t.  Since  we  excluded  the  contingency 
that  no  player  is  recognized,  players'  probability  assessments  agree  at  {t,Ps)  iS  p]  [p^)  + 
Pt  (Ps)  —  1-  Now,  if  p\  (ps)  +  Pf  (ps)  >  1,  then  each  player  thinks  that  the  probabihty  that 
he  is  going  to  be  recognized  at  t  is  higher  than  what  the  other  player  assesses.  As  explained 
in  the  Introduction,  being  recognized  is  not  bad;  so  we  say  that  players  are  optimistic  for  t 
at  Ps  iff  pI  (Ps)  -|-  Pf  (ps)  >  1.  Likewise,  we  say  that  players  are  pessimistic  for  t  at  p^  iff 
pI  (Ps)  +P?  (Ps)  <  1-  We  write 

yt(pJ=Pf  (Ps)+p?(pJ-i 

for  the  level  of  optimism  for  t.  Finally,  common-prior  assumption  in  our  context  corresponds 
to 

yt  =  0         (Vf  e  T) .  (CPA) 

When  CPA  holds,  by  No-trade  Theorem  of  Milgrom  and  Stokey  (1982),  players  have  no 
incentive  to  bet  on  p.   Since  we  are  particularly  interested  in  the  case  where  CPA  fails  (i.e., 


With  independently  distributed  p,  our  model  is  also  isomorphic  to  an  extension  of  the  model  of  Sakovics 
(1993),  originally  developed  by  Perry  and  Reny  (1993),  a  bargaining  model  where  players  endogenously  choose 
when  to  make  an  offer. 


y  7^  0),  our  players  do  have  an  incentive  to  bet  on  p;  and  they  are  precluded  from  doing 
so  merely  by  our  imbedded  assumption  that  the  side-bets  are  not  feasible  (e.g.,  they  are  not 
enforceable).'^ 

Notation  We  will  designate  dates  t,s  £T,  histories  p,  G  A''*,  p^  e  TV*,  players  i,j,k  6  N 
with  i  j^  j,  and  a  belief  structure  p  as  generic  members.  Note  that  p  and  y  are  functions 
defined  on  a  large  space  T  x  {Us^t^^)  and,  for  each  t  Q.T,  p]  and  yt  are  stochastic  processes, 
whose  values  at  any  s  and  Pg  are  pi  (p^)  and  yt  (Ps),  respectively. 

3     Dividing  a  Dollar  —  An  Intuitive  Exposition 

In  this  section  we  confine  ourself  to  the  simple  bargaining  problem  of  dividing  a  dollar,  and 
analyze  the  subgame-perfect  equilibria  of  a  fimte-horizon  game  G^[6,p\  under  the  following 
two  assumptions  of  transferrable  utility  and  no  learning  from  previous  recognitions. 

Assumption  TU     U  =  {u  e  [0,  l]^\u^  +  u'^  <  1} . 

Assumption  NL     p  is  independently  distributed  under  both  probability  distributions  P^  and 

Note  that,  according  to  Assumption  NL,  p  is  deterministic,  i.e.,  pi  (p^)  =  pi  (p'^,)  at  any 
two  histories  p^  and  p'^,  with  s,s'  <  t.  Hence,  pi  G  [0, 1]  and  yt  e  [—1, 1]. 

Under  these  two  assumptions,  we  will  first  construct  the  equilibrium,  and  prove  the  Agree- 
ment Theorem,  stating  that  players  will  reach  an  agreement  immediately,  so  long  as  the  game 
is  sufficiently  long  and  y  >  0.  Then,  allowing  y  to  take  both  positive  and  negative  values,  we 
will  present  an  example  with  a  long  delay,  and  discuss  the  general  properties  of  delay  in  our 
model.  Finally,  we  will  present  another  version  of  Agreement  Theorem  that  requires  only  that 
y  does  not  drop  suddenly. 

3.1      Continuation  values  in  Equilibrium 

For  any  given  finite-horizon  game  G*[5,p],  we  first  construct  a  subgame-perfect  equilibrium; 
any  subgame  perfect  equilibrium  is  payoff  equivalent  to  the  one  we  construct. 

Let  us  write  V^  for  the  continuation  value  of  player  i  at  the  beginning  of  date  t,  and 
St  =  Vf^  -I-  V^  for  the  "perceived  size  of  the  pie."  If  they  have  not  reached  an  agreement  before 


■"Even  if  the  side-bets  on  p  were  feasible,  agents  could  make  a  side-bet  on  Pj  void  by  agreeing  prior  to  t.  On 
the  other  hand,  if  there  exists  another  contractible  process  B  that  remains  publicly  observable  regardless  what 
players  do  and  that  yields  p(  measurable  with  respect  to  a  (Bo.  ■  ■  ■  ,  Bt),  then  we  would  simply  have  a  general 
immediate-agreement  theorem  similar  to  the  one  in  Yildiz  (1998).  For  they  can  simply  emulate  the  side-bets  on 
p  by  some  contracts  on  B. 


t,  each  player  will  get  0;  hence,  we  set 

V^'  =  V^  =  Si  =  0.  (1) 

Taking  this  as  final  values,  we  determine  V  and  5,  recursively.  To  do  this,  let  us  take  any 
t  e  T.  There  are  two  cases  we  need  to  address  separately.  We  first  consider  the  case  SSt+i  >  1. 
If  they  do  not  agree  at  t,  each  i  will  get  V^\i  at  t  +  1,  which  is  equivalent  to  ^V/^j  at  t.  They 
will  then  agree  on  a  utility  pair  u  =  [u^  ,u^^  only  if  u^  >  SV^_^_■^  at  each  i  E  N,  which  requires 
that  u^  +u^  >  5V'(l|_j  +  <5Vj^j  =  6St+i  >  1,  showing  that  such  u  is  not  feasible.  That  is,  they 
will  not  reach  an  agreement  at  t.  Therefore,  the  continuation  value  of  each  i  at  the  beginning 
of  date  t  will  be: 

V;  =  6V^^,         {6St+i>l).  (2) 

Adding  up  both  sides  of  the  equation  over  N,  we  obtain 

St  =  6St+i         {6St+i  >  1) .  (3) 

Note  that  their  beliefs  about  recognition  at  t  is  immaterial  to  the  problem. 

Now  we  consider  the  case  6St+i  <  1.  Any  player  j  accepts  an  offer  iff  it  gives  him  at  least 
SV^_^_■^.  If  the  recognized  player  i  offers  to  give  SV^^-^  to  j,  he  is  left  with  1  —  SVi^^,  which  is  now 
at  least  as  high  as  his  continuation  value  SV^^^.  Thus,  in  equilibrium,  player  i  offers  to  give 
^^+1  *°  j  ^^"^  *°  *'^'^^  ^  ~  ^^+1  ^°  himself;  and  the  offer  is  accepted.  Therefore,  continuation 
value  of  player  i  at  the  beginning  of  date  t  is: 

V;    =    pj(l-'5^4i)  +  (l-pD'5VtVi 

=    pi{l-6St+i)  +  6V^^,         {6St+i<l).  (4) 

That  is,  the  continuation  value  of  player  z  at  i  is  the  present  value  ^V^+j  of  his  continuation 
value  at  i  +  1  plus  the  rent  pl{l  —  SSt+i)  he  expects  to  extract  when  he  makes  an  offer.  By 
adding  equation  (4)  up  for  players,  we  obtain 

St  =  l  +  yt-  SytSt+i        {6St+i  <  !)•  (5) 

Briefly,  the  processes  V^,  V'^,  and  5  defined  by  the  equations  (1-5)  describes  a  subgame- 
perfect  equilibrium.  Furthermore,  V^  and  V"^  are  the  only  continuation  values  consistent  with 
any  subgame-perfect  equilibrium;  and  S  prescribes  whether  they  agree  at  any  subgame-perfect 
equilibrium.  Note  that,  in  equilibrium,  agreement  at  any  date  is  fully  determined  by  the 
aggregate  data  y,  6  (and  5). 


3.2     Agreement  and  Disagreement  Regimes 

Our  equilibrium  has  two  regimes.  The  first  one,  which  we  call  disagreement  regime,  is  charac- 
terized by  inequality  St+i  >  1/5.  In  this  regime,  if  they  have  not  yet  reached  an  agreement, 
players  do  not  reach  an  agreement  at  t,  either.  Hence,  their  beliefs  about  which  player  will  be 
recognized  at  t  is  irrelevant;  and  the  perceived  size  of  the  "pie"  shrinks  geometrically  as  we  go 
back  in  time. 

If  i  is  the  first  date  they  reach  an  agreement  after  a  period  of  disagreement  regimes,^  the 
perceived  size  of  the  pie  at  any  t  in  that  period  will  be 

S^  =  6'-'S:.  (6) 


Hence,  the  length  of  such  a  period  will  be 

log  Si 


L{Si,6) 


(7) 


log(l/<5) 

where  operator  [•]  finds  the  smallest  integer  that  is  greater  than  or  equal  to  the  argument 
Since  S^  can  be  at  most  2,  the  length  of  a  disagreement  period  can  be  at  most 

log  2 


1(6) 


(8) 


log(l/<5) 

Hence,  given  any  6,  the  lengths  of  disagreement  periods  are  bounded  uniformly.  In  efficiency 
terms,  the  length  of  delay  can  be  at  most  as  large  as  almost  half  the  pie  is  lost,  i.e.,  1/2  < 

The  second  regime  is  called  the  agreement  regime,  and  characterized  by  inequality  St+i  < 
1/6.  In  this  regime,  if  they  have  not  reached  an  agreement  yet,  players  immediately  agree  on 
a  division  that  gives  the  recognized  player  (say  i)  I  —  ^^t+i'  which  is  higher  than  6V^_^_.^■,  and 
this  is  what  he  would  get  if  the  other  player  j  were  recognized.  That  is,  the  recognized  player 
extracts  a  rent 

Rt  =  l-  <5V/+i  -  <5V;Vi  =  1  -  6St+i . 

The  discrepancy  among  players'  beliefs  on  which  player  will  get  this  rent  causes  the  dis- 
crepancy between  the  perceived  size  St  of  the  pie  and  its  actual  size  of  1.  In  fact,  we  can 
re-write  equation  (5)  as 

St  =  l  +  yt{l-  SSt+i)  =  1  +  ytRt  '  (9) 

so  that  the  discrepancy  is  5t  —  1  =  ytRt-  Therefore,  a  high  rent  for  the  recognized  player 
aligned  with  excessive  optimism  may  prevent  players  from  reaching  an  agreement  in  some 
previous  dates. 


^  Since  they  agree  at  t  —  1,  there  is  such  a  date  for  each  disagreement  period. 


If  yt  <  0  and  we  have  an  agreement  regime  at  t,  then  by  (9),  we  have  an  agreement  regime 
at  i  -  1,  as  well.  Thus,  iiyt  <0  at  each  t  £T,  then  we  will  have  agreement  regimes  throughout 
the  game.  In  fact,  this  case  can  be  imbedded  in  the  Rubinstein-Stahl  framework  by  recognizing 
each  player  i  with  probability  pi,  and  by  recognizing  nobody  with  probability  1  —  pj  —Pt-  We 
will  now  analyze  the  case  of  y  >  0. 

3.3     Perpetual  Optimism 

Confining  ourselves  to  long  but  finite  games,  in  this  section,  we  will  show  that  players  will 
reach  an  agreement  immediately,  so  long  as  players  are  optimistic  throughout  the  game,  i.e., 

yt>0         Vi  G  T.  (PO) 

We  start  with  the  following  Lemma,  giving  us  an  agreement  regime  at  each  date  s  <  t  whenever 
the  pie  at  <  +  1  is  perceived  to  be  of  the  "right  size." 

Lemma  1  Assume   TU  and  NL.    Given  any  t  with  yt    >   0,    if  St+i   6    [l,l/i5],    then  St   G 
[l,2-,5]c[l,l/.5]. 

Proof.  Assume  that  St+i  G  [1,1/^].  Then,  we  have  an  agreement  regime  at  t,  and  hence 
by  (9),  we  have  St  =  I  +  ytRt,  where  the  rent  Rt  =  I  —  6St+i  €  [0, 1  —  6]  is  bounded  from 
above.  Since  yt  €  [0, 1],  ytRt  G  [0, 1  —  6]  and  therefore  St  =  I  +  ytRt  G  [1,2  —  6].  Note  that 
2  —  6  <  1/6  (with  equality  only  at  5  =  1).    ■ 

Under  perpetual  optimism  (PO),  Lemma  1  gives  us  immediate  agreement  for  sufficiently 
long  games.  To  see  this,  first  note  that,  by  (PO),  yj_i  >  0,  hence  5f_i  =  1  +yi-i  >  1-  When 
j/t_i  <  {1  —  S)  /6,  we  have  5f_i  =  1  +  yi-i  <  1/6  so  that  Si-i  G  [1, 1/^].  In  that  case,  using 
Lemma  1,  we  can  conclude  via  mathematical  induction  that  5^  G  [1,1/5]  at  each  t  <  t  —  1, 
showing  that  we  have  an  agreement  regime  at  each  t  &T. 

Now  we  consider  the  case  when  yi^i  >  {1  —  6)  /6.  In  this  case,  we  have  5f_i  =  1  +  yi-i  > 
1/6,  and  hence  a  disagreement  regime  at  i  —  2.  In  fact,  we  have  already  shown  that  there  is  a 
period  of  disagreement  regimes  with  length  L{St-i,6)  <  L  (6)  ending  at  i  —  2.  Now,  assuming 
that  the  game  is  sufficiently  long,  consider  the  last  date  with  an  agreement  regime  before  i—  2, 
namely  i  =  i  —  2—  L{St-i,6).  By  definition,  we  have  S^_^_i  <  1/6  and  5j_,_2  >  1/6.  The  latter 
inequahty  also  implies  that  5(_,_i  =  6Si_^2  >  1'  i-^-'  Si_^i  G  [1, 1/6].  Once  again,  using  Lemma  1, 
we  can  conclude  via  mathematical  induction  that  St+i  G  [1, 1/6]  at  each  t  <t,  showing  that  we 
have  an  agreement  regime  at  each  t  <t.  To  sum  up,  in  this  case,  for  t  >  L{St^i,6)  +  2,  we  have 
three  periods  of  constant  regimes:  We  have  agreement  regimes  at  {0, . . .  ,i  —2—  L{St-i,6)}, 
disagreement  regimes  at  {i—  L{St-i,6)  —  1, . . .  , i  —  2},  and  again  an  agreement  regime  at  t  —  1. 
(This  is  exhibited  in  Figure  1  for  the  extreme  case  when  yt  =  pj  =  p^  =  1  at  each  t  £T.) 

In  either  case,  we  have  an  agreement  regime  at  each  t  <  i  —  2—  L{6)  <  i  —  2—  L{Si^i,6), 
proving  our  first  Agreement  Theorem: 
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Figure  1;  The  perceived  size  of  the  pie  in  the  extreme  case  where  yt  =  p}  =  pf  =  1  at  each 
t  £T.  [i  =  30,  6  =  O.9.]  Note  that  we  have  an  agreement  regime  at  i  —1,  too. 


Theorem  1  Assume  TU,  NL,  and  PO.  Then,  we  have  an  agreement  regime  at  each  t  £  T 
with  t  <  t-  L{6)  -2. 

Excluding  the  end  of  the  game,  our  Theorem  states  that,  if  players  are  optimistic  throughout 
the  game,  they  will  reach  an  agreement  immediately,  suggesting  that  the  Example  we  have 
given  in  the  Introduction  may  be  rather  singular.  Actually,  our  Example  illustrated  that 
sequential  bargaining  allocates  a  non-negative  rent  for  the  recognized  player  at  any  date  t;  and 
this  rent  can  be  very  large.  If  players  are  also  extremely  optimistic  about  getting  this  large 
rent,  they  may  choose  to  wait  at  t  —  1. 

Our  Theorem  goes  one  step  beyond  this  intuition.  At  such  an  agreement  regime  t,  if  players 
are  also  sufficiently  optimistic  for  subsequent  times,  the  rent  for  the  recognized  player  will  be 
small,  a  fact  that  we  estabhshed  in  Lemma  1.  Our  Theorem  states  that,  in  equilibrium,  this 
rent  will  be  so  small  that  the  players  will  reach  an  agreement  at  i  —  1  even  if  they  are  very 
optimistic  about  getting  it. 

3.3.1      Perpetual  Optimism  —  Infinite  Horizon 

Theorem  1  establishes  that,  in  case  of  perpetual  optimism,  we  have  agreement  regimes  through- 
out the  game  except  for  a  bounded  period  just  before  the  end.  Naturally,  as  i  — >  oo,  this  period 
of  disagreement  regimes  disappears,  and  thus  we  have  agreement  regimes  throughout  the  game 
-  like  in  the  Rubinstein-Stahl  framework. 

Take  any  8  £  (0, 1)  and  any  infinite  behef  structure  p.  For  any  sufficiently  long  truncation 
G*,  write  S[t\  and  i[t\  for  the  size  of  the  pie  and  the  last  date  before  any  disagreement,  respec- 
tively. Given  any  t  <  t[i\,  we  have  an  agreement  regime  at  each  s  with  t  <  s  <  t[i\;  hence, 
solving  (5)  on  this  interval,  we  obtain 

t[t]-i 
St[t]=  5]7rf(l  +  y,)  +  7rf5,-[i], 

where  ttj  is  YVjZ^  {Syj)  when  t  <  s  and  1  when  t  =  s.  As  ?  — >  oo,  i[t\  — >  oo,  hence  tt^  '  —y  0; 
and  since  0  <  S^<n  <  2,  St[t\  converges  to 

oo 

St[<^]=J2nt{l  +  ys).  (10) 

s=t 

By  Theorem  1,  St[t\  e  [1,1/5]  for  every  sufficiently  large  i  so  that  St  [oo]  6  [1,1/5]  at  each 
f  G  N.  That  is,  for  i  =  oo,  we  have  agreement  regimes  throughout:  at  any  t,  the  recognized 
player  i  gives  the  other  player,  say  j,  ^Vj-^^Joo]  G  [0,6],  keeping  1  —  5VJ-!^j[oo]  G  [1  —  8,1]  for 
himself  -  a  behavior  that  replicates  the  equihbrium  behavior  in  the  Rubinstein-Stahl  frame- 
work, characterized  by  CPA.  In  fact,  when  CPA  holds  (i.e.,  y  =  0),  V^'[oo]  can  take  any  value 
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in  [0, 1],  thus  any  such  division  is  consistent  with  CPA;  and  therefore  CPA  (and  thereby  the 
Rubinstein-Stahl  framework)  is  not  refutable  in  the  case  of  perpetual  optimism. 

To  see  this,  let  yi  =  y  6  [0,1]  at  each  i  G  N.  Now,  by  inserting  y  for  each  y^  in  (10), 
one  can  easily  compute  that  St  [oo]  =  j^§^,  and  hence  Rt  [oo]  =  1  -  8St+i  [oo]  =  j^  =  R. 
Moreover,  by  Proposition  2  of  Section  4,  we  have  V}  [oo]  =  Y!^=t  ^^"Vs-^s  [oo]-  Thus,  V^  [oo]  = 
^E^t^'"Vs-  Since  p\  G  [y,  1]  at  each  5  6  T,  the  range  of  (1  -  5)  E^t  ^'"Vs  is  [y,l],  and 
therefore  the  range  of  V^  [oo]  is  jf^-,  yij^  •  When  CPA  holds  (i.e.,  at  y  =  0),  this  interval  is 
entire  [0,1],  thus  any  outcome  is  consistent  with  CPA. 

As  y  increases,  the  interval  shrinks;  and  at  y  =  1  it  consists  of  1/(1  +  6).  That  is,  in 
the  equilibrium  of  the  extreme  case  when  everyone  keeps  believing  that  he  will  make  all  the 
remaining  offers  no  matter  what  happened  so  far  (i.e.,  pj  =  pj  =  y  =  1  Vi  G  T),  the  recognized 
player  offers  to  take  1/(1  +  6),  leaving  the  other  player  6/(1  +  6)\  and  the  offer  is  accepted. 
This  is  the  equilibrium  outcome  under  the  Alternating-Offer  Procedure,  analyzed  by  Rubinstein 
(1982).  Note  that  Rubinstein's  well-known  division  is  the  only  division  that  is  not  ruled  out 
by  any  level  of  optimism. 

3.4     A  Long  Delay  in  Agreement 

Theorem  1  establishes  immediate  agreement  for  the  long  games  where  players  remain  opti- 
mistic. Now,  allowing  players  be  pessimistic  sometimes,  we  will  create  a  simple  example  where 
the  agreement  is  delayed  for  a  long  while,  no  matter  how  long  the  game  is.  The  rationale 
behind  this  example  is  simple.  When  each  player  believes  that  he  will  not  be  able  to  make  an 
offer  after  a  given  date,  they  will  act  as  if  the  negotiation  stops  there,  imitating  a  short  game. 

Since  the  length  of  a  period  of  disagreement  regimes  cannot  be  longer  than  L  [6),  we  already 
have  an  upper  bound  for  the  delay  in  agreement.  Note  that  this  upper  bound  is  rather  long: 
^^(  )+i  <  1/2^  i.e.,  almost  half  the  benefit  is  lost.  Nevertheless,  it  is  not  loose,  as  our  example 
reveals: 

Consider  any  belief  structure  p  with  yt-  =  1  and  yj  =  —  1  at  each  t  >  t* ,  where  t*  is 
any  date  with  0  <  i*  <  min{L  {8)  ,t}.  At  t*  and  thereafter,  thinking  that  he  will  no  longer 
have  any  other  chance  to  make  an  offer,  each  player  is  willing  to  accept  any  offer.  Hence,  the 
recognized  player  can  extract  a  huge  rent:  we  have  iilj  =  1  at  each  t  >  t* .  They  consider  t* 
an  auspicious  moment,  each  thinking  that  he  is  going  to  make  an  offer  at  t*.  Amplified  by  the 
extreme  pessimism  for  the  future,  their  optimism  for  t*  leads  them  to  perceive  the  pie  at  t* 
very  large: 

Sf  =  l+yt'Rf  =2>  1/6. 

The  pie  at  t*  is  perceived  to  be  so  large  that  they  will  never  reach  an  agreement  before  t*: 

St+i  =  6^'-'-^Sr  =26^'-''^  >l/6,         (0  <  i  <  i*) 
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where  the  last  inequahty  is  due  to  the  definition  of  L  [6),  which  is  defined  by  26  ^  '  S  (1, 1/6]. 
That  is,  during  {0, . . .  ,t*  —  1},  without  agreeing,  players  wait  for  the  auspicious  moment  t*, 
when  all  the  offers  are  going  to  be  accepted. 

This  example  illustrates  some  key  properties  of  delays  caused  by  lack  of  a  common  prior. 
Firstly,  the  delay  in  our  model  is  not  Pareto-optimal  in  general.  To  see  this,  consider  the 
case  yi  =  1.  Now,  giving  the  dollar  to  the  player  recognized  at  i  =  1  Pareto- dominates  the 
equilibrium  outcome,  which  gives  the  dollar  to  the  player  recognized  at  t*.  Note  that  a  player's 
expected  utility  levels  under  these  two  schemes  are  6  and  6*  ,  respectively.  [This  also  shows  that 
the  equilibrium  outcome  in  the  example  presented  in  the  Introduction  was  Pareto-optimal.] 

Second,  at  the  beginning  of  our  game  it  is  common  knowledge  that  players  will  not  reach 
an  agreement  before  t*;  whereas  the  delay  is  only  a  possibihty  in  usual  bargaining  models  with 
private  information,  as  the  types  with  the  least  advantageous  information  reach  an  agreement 
immediately  in  any  separating  equilibrium  in  these  models. 

Third,  there  is  a  consensus  among  our  players  that  the  delay  is  costly  and  that  there  is  a 
division  at  t  =  0  that  gives  each  player  more  than  what  he  gets  in  equilibrium.  That  is,  they 
agree  that  the  size  of  the  pie  at  t  =  1  is  only  6;  and  each  player  further  thinks  that  they  both 
would  get  more  than  their  equilibrium  consumption  if  they  divided  the  dollar  at  f  =  0  by 
giving  6  +  (1  —  6)/2  to  himself  and  leaving  (1  —  6)/2  to  his  opponent.  There  is  no  consensus, 
however,  on  which  division  at  i  =  0  dominates  the  equilibrium  outcome. 

Finally,  even  though  there  is  a  consensus  that  the  delay  is  costly,  one  cannot  find  a  mecha- 
nism that  would  yield  an  agreement  at  the  beginning  and  would  be  accepted  by  each  player  to 
replace  the  current  situation.  In  particular,  they  cannot  agree  on  a  procedure  that  recognizes 
them  with  "objective"  probabilities  throughout  the  negotiation.  For,  if  such  a  mechanism  were 
accepted  by  player  1,  it  would  give  him  at  least  5* ,  thus  would  give  player  2  at  most  1  —  6*, 
and  would  be  rejected  by  player  2. 

A  period  of  low  y  does  not  necessarily  mean  that  they  are  unreasonably  pessimistic;  it  may 
simply  reflect  the  fact  that  agents  are  not  going  to  negotiate  during  this  period,  as  happens 
in  bargaining  with  a  deadline  or  several  "bargaining  sessions."  In  that  case,  our  analysis 
illustrates  that,  if  the  end  of  the  session  or  the  deadline  is  sufficiently  close,  agents  will  wait 
imtil  the  deadline  to  settle.  This  deadline  effect  is  observed  in  laboratory  experiments  and  the 
real- world  negotiations. 

3.5      A  sudden  loss  of  optimism 

In  the  previous  section,  we  illustrated  that  a  sudden  loss  of  optimism  may  give  players  an 
incentive  to  wait.  Now  we  will  establish  that,  whenever  there  is  a  period  of  disagreement 
regimes,  it  must  be  followed  by  a  sudden  loss  of  optimism. 

Towards  this  goal,  our  first  lemma  states  that  if  a  disagreement  regime  proceeds  an  agree- 
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ment  regime,  there  must  be  a  substantial  drop  in  y. 

Lemma  2  Given  any  G^[S,p]  and  any  t  <  i  —  I,  if  St  >  ^  and  St+i  <  -j,  then  y^  —  j/^^j  > 
{l-b)/6. 

Proof.    Take  any  t  <  t  —  1  with  St  >  \  and  St+\   <  |.    Since  St+i   <  |,  we  have  an 
agreement  regime  at  t,  and  hence 

St  =  l  +  yt{l-8St+i)>^.  (11) 

Since  1  —  SSt+i  >  0,  we  then  have  yt  >  0.  Since  St  >  1/S  and  yt  >  0,  by  Lemma  1,  we  have 
St+i  <  1-  In  that  case,  we  have  5f+2  <  1/^,  for  otherwise  we  would  have  a  disagreement 
regime  at  i  +  1,  rendering  St+i  =  8St+2  >  1-  Hence,  we  have  an  agreement  regime  at  t  +  1, 
and  thus 

5t+i  =  l+yt+i(l-<55,+2)<l,  ■  (12) 

yielding  yt+i  <  0.  By  combining  (11)  and  (12),  and  writing  A  =  yt  —  yt+i,  we  obtain 

1  0 

-^<St  =  l  +  yt~Syt[l  +  yt)  +  SytA  +  6^ytyt+iSt+2-  (13) 

Since  ytyt+iSt+2  <  0,  this  gives  us  8ytA  >  8ytA  +  6'^ytyt+\St+2  >  ^  -  I  -  yt  +  6yt{l  +  yt)- 
Hence, 


Once  can  check  that  the  expression  on  the  right  hand  side  is  minimized  at  y*  =  -j^Jj  —  1, 


taking  the  value  of  2w|  (^  —  l)  —  (^  —  l),  which  is  greater  than  \/8  -  1.    ■ 

By  Lemma  2,  if  we  have  a  disagreement  regime  at  i  —  1  while  we  have  an  agreement  regime 
at  t,  we  must  have  yt  —  yt+i  >  (1  —  ^)  /^-  Put  it  differently,  if  yt  —  yt+i  <  {1—6)  /6  and  we 
have  an  agreement  regime  at  i  <  i  —  1,  we  must  also  have  an  agreement  regime  at  t  —  1.  We 
already  know  that  we  have  an  agreement  regime  at  some  i  with  t  —  2  —  L{6)  <  i  <  t  —  I. 
Hence,  if  y  is  smooth  enough  so  that  yt  —  yt+i  <  (1  —  5)  /5  at  each  i  G  T,  we  would  have  an 
agreement  regime  at  each  t  <  i: 

Theorem  2  Assume  TU,  NL,  and  that  yt  —  yt+\  <  (1  —  6)  /6  at  each  t  £  T.    Then,  we  have 
an  agreement  regime  at  each  t  £T  with  t  <i  —  L{6)  —  2. 

Excluding  the  end  of  the  game,  Theorem  1  stated  that,  if  players  are  optimistic  throughout 
the  game,  they  will  reach  an  agreement  immediately.  Proposition  2  establishes  that,  actually, 
we  do  not  need  optimism  stay  high;  all  we  need  is  that  it  does  not  drop  too  fast.  In  fact,  by 
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requiring  perpetual  optimism,  Theorem  1  also  bounded  the  drop  in  y:  iiyt,yt+i  6  [0,1],  then 
yt  —  yt+i  <  1-  Of  course,  this  bound  is  much  looser  than  that  of  Theorem  2;  and  these  two 
results  are  logically  independent.  We  may  have  a  disagreement  regime  just  before  the  end  of 
the  game,  precisely  because  after  the  game  ends  yt  is  identically  —1,  and  thus  we  may  have  a 
substantial  drop  in  y  at  the  end  of  the  game.  Theorem  2  also  implies  that  if  the  transition  to 
—  1  is  smooth  enough,  there  will  be  no  disagreement  regime  at  the  end  of  the  game,  either. 

We  will  present  our  Agreement  Theorem  for  the  general  case  in  Section  5.  Beforehand,  we 
present  some  prehminary  results. 

4     Equilibrium  and  the  Rents  —  General  Ccise 

In  this  section,  we  will  describe  the  subgame-perfect  equilibria  of  our  game.  In  the  finite- 
horizon  case,  all  the  equilibria  will  be  payoff  equivalent  to  each  other.  In  the  infinite  horizon 
case,  we  will  only  consider  the  limit  of  the  equilibria  of  the  finite-horizon  truncations.  In 
equilibrium,  at  each  date,  the  recognized  player  extracts  a  non- informational  rent,  and  these 
rents  determine  players'  equilibrium-payoffs. 

Towards  describing  the  subgame  perfect  equilibria,  let  us  take  any  finite-horizon  game 
(7*  [6,p],  and  write  Vf^  [t\  (pj  for  the  equihbrium  continuation- value  of  a  player  i  at  any  t  with 
history  P(.  The  stochastic  process  V  [t\  is  defined  by  the  recursive  equation 

V/  [t\  iPt)  =  P'  [p,  =  i\Pt)  m\bVt+i  [t\  {p„ i))  +  P'  {p,  =  j\p^)  6V,\,  [t\  {p„j)         (15) 

and  the  boundary  condition 

Vi[i\=0,  (16) 

where  m'  :  K^  — ►  R+  is  defined  through  m^(u)  =  max  [{v^}  U  {u'|  (u^,u^)  G  U,  u^  >  v^}\  at 
each  i;  G  K^  for  i  7^  j.  Note  that  rn}{v)  is  the  maximum  payoff  player  i  can  enjoy  if  he  needs 
the  consent  of  player  j  for  an  agreement  and  if  the  payoff  vector  v  would  be  realized  in  case  of 
a  disagreement.  (Of  course,  he  wiU  choose  to  disagree  if  it  is  too  costly  to  convince  the  other 
player,  i.e.,  when  v  0  U.)  By  Assumption  FDA,  when  i  gets  m^{v),  j  gets  vK  Hence,  (15) 
expresses  that,  with  probability  P^  {p^  =  i|p(),  player  i  wiU  be  recognized,  in  which  case  he 
will  get  the  maximum  payoff  given  that  he  needs  to  give  at  least  5V(-^j  [i]  {Pf,i)  to  player  j  in 
order  to  reach  agreement,  and  with  probabihty  P'  (/9(  =  j|pt),  j  will  be  recognized,  in  which 
case  i  will  get  his  continuation  value  in  the  case  of  a  delay.  Condition  (16)  simply  states  the 
fact  that  players  automatically  get  0  at  t. 

The  equihbrium  strategy  s,  [t]  of  a  player  i  [^  j)  will  be  as  follows: 

Given  any  t  with  pj,  if  i  is  recognized  at  t,   and  if  6Vt+\^{p^,i)    G   U,  offer 
to  give  ^V/^_j  [F|  (P(,t)  to  j  and  m}{6Vt+\  [i]  {Pt^'^))  to  i;  if  i  is  recognized,  but 
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^Vj+i  [t\  {Pf,i)  ^  U,  then  offer  to  give  0  to  j  and  m'  (0)  to  z;  if  j  is  recognized, 
accept  his  offer  iff  he  gives  i  at  least  SV^^^  [t\  {pt,j). 

Given  any  infinite-horizon  game  G°°  [6,p],  for  each  t  G  N,  consider  the  finite-horizon  trun- 
cation G*  [5,p],  which  agrees  with  G°°  [6,p]  until  i,  and  ends  there.  Defining  V  [t\  as  above, 
assume  that 

Vt[t\{p,)^Vt[^]ip,)      asf-cx)         (Vp(,Vi)  (17) 

for  some  V  [oo].  We  define  the  strategy  profile  s  [oo]  =  (sj  [oo]  ,S2  [oo])  as  above,  by  taking 
t  =  oo.  The  following  Proposition  states  that,  when  i  is  finite,  s  [f]  =  (si  [t\ ,  S2  [i])  is  essentially 
the  unique  subgame  perfect  equilibrium;  when  <^=  oo,  s  [oo]  is  a  subgame-perfect  equilibrium, 
but  not  necessarily  unique.  The  proof  of  this  Proposition  is  fairly  straight- forward,  and  can 
be  found  in  Yildiz  (2000). 

Proposition  1  Given  any  G°°  {6,p\  with  (17),  s  [oo]  is  a  subgame-perfect  equilibrium.  Given 
any  finite-horizon  game  G*  [6,  p] ,  s  [i]  is  a  subgame-perfect  equilibrium.  Moreover,  for  i  <  oo,  at 
any  subgame-perfect  equilibrium,  and  at  any  date  t  unth  pj,  the  following  are  true:  the  vector  of 
continuation  values  at  the  beginning  oft  is  Vj  (pj);  if  6Vt+i{pt,i)  is  in  the  interior  ofU,  and  i 
is  recognized,  then  they  reach  an  agreement  that  gives  5V^_^_^{p^,i)  to  j  and  m^{6Vt+i  [t\  {pt,i)) 
to  i;  and  if  8Vt+\{pi,i)  ^  U  and  i  is  recognized,  then  they  do  not  reach  an  agreement  at  t. 

In  the  rest  of  the  paper,  we  will  confine  ourself  to  the  equilibrium  s;  we  suppress  the  terms 
in  square  brackets. 

In  equilibrium,  the  recognized  player  extract  a  rent;  and  these  rents  constitutes  the  bar- 
gaining power.  To  see  this,  observe  that,  given  any  pj  and  any  player  i,  the  difference  between 
his  payoff  at  {Pt,i),  when  he  makes  an  offer,  and  his  payoff  at  {Pf,j),  when  he  does  not,  is 
m'(5Vt+i(pt,i))-5ViVi(p„j)  =  [vn}{8Vt+^{p„i))  -  6Vl^,{p„i)\-^[6V^%^{p,,i)  -  6V^\,{p„j)\. 
Here,  the  last  term  is  informational;  that  is,  the  identity  of  the  recognized  player  at  t  affects 
players'  beliefs  about  the  future  recognitions,  which  in  turn  affect  their  continuation  values  in 
equihbrium.  The  first  term, 

B\{p,)  =  m\dVt+^{p^,i))  -  dVi^M^i),  (18) 

on  the  other  hand,  is  caused  by  his  opportunity  to  make  an  offer  that  can  be  rejected  only  by 
delaying  the  agreement  until  t-\-  \\  and  we  call  it  the  (non-informational)  rent  for  i  at  t. 
Substituting  (18)  in  Equation  (15)  and  carrying  out  the  necessary  algebra,  we  obtain 

V^ip,)  =  E%,-,yRl\p,]  +  8E'[V^^,\p,],  ■  (19) 

where  l{p,=i}  is  the  indicator  function  of  {p^  =  i},  taking  values  1  and  0  when  pt  =  i  and 
Pi  7^  i,  respectively.  Equation  (19)  states  that  the  value  of  the  game  for  a  player  i  at  any  date 
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t  is  his  discounted  expected  value  of  the  game  at  i  +  1  plus  the  rent  he  expects  to  extract  from 
being  a  recognized  player  at  t.  Since  the  value  of  the  game  at  i  is  identically  nil,  this  gives 
us  the  following  Proposition,  stating  that,  in  equilibrium,  the  continuation  value  of  a  player 
is  the  present  value  of  all  the  rents  he  expects  to  extract  in  the  future.  In  an  agreement,  the 
recognized  player  gives  the  other  player  his  continuation  value,  keeping  the  rest  of  the  pie  for 
himself.  Therefore,  a  player's  share  in  an  agreement  is  itself  the  discounted  sum  of  the  rents 
he  expects  to  extract  in  the  future,  plus  the  current  rent  in  case  he  is  the  recognized  player. 

Proposition  2   Given  any  i  G  A'^,  t  Q  T  and  any  pi,  we  have 

i-l 

V^Pt)  =  J2^'~'^'  [l{p.=,}^slPt]  •  (20) 

s=t 

Using  (16),  (19),  and  the  law  of  iterated  expectations,  one  can  easily  check  via  mathematical 
induction  that  (20)  holds  for  any  finite  i.  Using  the  fact  that  R  is  uniformly  bounded,  Yildiz 
(2000)  shows  that  it  holds  for  the  infinite-horizon  case  as  well. 

Proposition  2  states  that  the  continuation  value  of  a  player  is  the  present  value  of  the  rents 
he  expects  to  extract  in  the  future.  Since  the  rents  themselves  depend  on  players'  beliefs  as 
well  as  the  history,  it  is  not  clear  that  a  player  would  not  lose  if  each  player  comes  to  believe 
that  he  is  more  likely  to  make  offers  in  the  future.  In  fact,  in  Appendix  A  we  present  an 
example  where  a  player  loses  as  he  finds  himself  more  likely  to  make  an  offer  at  i  =  1,  while 
everything  else  remains  unchanged.  Nevertheless,  there  we  also  show  that  a  player  will  not  lose 
when  each  player  comes  to  believe  that  he  is  more  likely  to  make  offers  in  the  future,  so  long 
as  the  recognition  process  is  affiliated,  e.g.,  it  is  exchangeable  or  independently  distributed  - 
the  cases  that  we  mainly  focus.  In  that  case,  our  definition  of  optimism  will  be  unambiguously 
valid. 

According  to  s,  players  reach  an  agreement  at  any  {pt,i)  iff  SVt+i{pt,i)  G  U.  In  that  case, 
we  will  say  that  we  have  an  agreement  regime  at  {pf,i),  and  a  disagreement  regime,  otherwise. 

5      Agreement  Theorem 

Assuming  that  players  do  not  change  their  beliefs  as  they  observe  who  gets  a  chance  to  make 
an  offer  and  when  (NL),  for  the  case  of  transferrable  utility  (TU),  Theorem  1  estabhshed  that, 
if  the  level  of  optimism  y  stays  non-negative  for  a  sufficiently  long  future,  players  will  reach 
an  agreement  immediately.  In  this  section,  we  will  present  an  extension  of  this  result.  Our 
extension  will  state  that,  if  the  level  of  optimism  y  stays  sufficiently  high  for  a  sufficiently 
long  future,  players  will  reach  an  agreement  immediately.  While  Assumption  TU  is  dropped 
entirely,  it  will  be  clear  that  a  weaker  form  of  NL  remains  embedded  in  the  assumption  that  y 
stays  sufficiently  high  for  a  suiBciently  long  future. 
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We  first  present  the  result  for  the  finite  horizon  case. 

Theorem  3  Given  any  6  G  (0, 1)  and  any  non-negative  integer  t*  G  N,  there  exists  some 
y*  e  (  —  1, 1)  such  that,  for  every  finite-horizon  game  G*[6,p]  vjith  t  >  t*  +  L{6)  +  3  and  with 
yt  >  y*  it  each  t  Q.T,  we  have  an  agreement  regime  at  every  t  <  t* . 

Theorem  3  extends  Theorem  1  in  a  weaker  form,  by  dropping  Assumptions  TU  and  NL. 
Intuitively,  if  players  know  that  their  opponents  will  remain  sufficiently  optimistic,  they  will 
adjust  their  expectations  down  so  that  they  will  reach  an  agreement  immediately,  except  for 
the  end  of  the  game,  where  we  may  have  a  period  of  disagreement  regimes  due  to  a  large 
last-mover-advantage.  Under  Assumptions  NL  and  TU,  Theorem  1  required  only  that  players 
do  not  lose  their  optimism,  hence  y*  =  0  was  sufficient  for  immediate  agreement.  In  Example 
3  we  will  establish  that  y*  may  need  to  be  strictly  positive  with  risk-averse  players. 

The  proof  of  this  Theorem  is  similar  to  that  of  Theorem  1,  but  less  transparent,  and 
therefore  relegated  to  Appendix  B.  The  proof  consists  of  the  following  steps.  Firstly,  we 
show  that,  under  NL,  the  lengths  of  disagreement  periods  are  uniformly  bounded.  Then,  we 
show  that  our  result  holds  with  strict  inequalities  for  the  case  when  y  is  identically  I.  Finally, 
invoking  a  continuity  property  of  equilibrium  payoffs  with  respect  to  the  belief  structures,  we 
obtain  our  result. 

We  extend  Theorem  3  to  the  infinite-horizon  case  as  follows. 

Theorem  4   Given  any  6  6  (0, 1)  and  any  non-negative  integer  t*  G  N,  assume  that 

Vt  [i,p]  -^  Vt  [oo,p]      asi-*oo  (Vt  <  t*)  (21) 

uniformly  over  belief  structures  p,  where  i  and  p  indicates  the  underlying  game.  Then,  there 
exist  some  i  G  N  and  y*  G  (  —  1, 1)  such  that,  given  any  G'^[6,p]  with  yt  >  y*  at  each  t  <  t,  we 
have  an  agreement  regime  at  every  t  <t*. 

Assuming  that  equilibrium  continuation-values  converge  uniformly  over  belief  structures. 
Theorem  4  extends  Theorem  3  to  infinite-horizon  games,  stating  that,  if  the  level  of  optimism 
stays  sufficiently  high  for  a  sufficiently  long  future,  players  will  reach  an  agreement  immediately. 
Intuitively,  under  (21),  the  infinite  horizon  can  be  approximated  by  a  finite  horizon,  whence 
a  version  of  Theorem  3  (with  strict  inequalities)  would  give  us  immediate  agreement  for  the 
infinite  horizon  case. 

Here,  a  finite  period  of  a  sufficiently  high  level  of  optimism  suffices  for  an  immediate 
agreement.  By  requiring  the  optimism  stay  high  only  for  a  finite  time,  we  allow  players'  beliefs 
to  merge,  a  plausible  property  exhibited  by  many  learning  models,  such  as  the  one  we  will 
employ  in  Section  6. 
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While  Assumption  TU  is  dropped  entirely,  a  weaker  form  of  NL  remains  embedded  in 
Theorems  3  and  4.  By  requiring  that  yt  >y*,  our  Theorem  also  requires  that  p]  >  y*  for  each 
i,  as  p\  =  yt  +  I  —  pI  >  yt-  Since  pj  <  1,  this  further  requires  that  |pj  (pj  —  p\  ip't)\  <  1  —  y* 
for  any  two  histories  p^  and  p[,  bounding  how  much  can  be  learned  before  t. 

5.1     Discussion 

If  a  player  i  believes  that  his  opponent  j  is  very  optimistic  and  will  remain  so,  he  will  lower 
his  expectations.  For  such  an  optimistic  opponent  will  not  settle  for  httle,  even  if  i  thinks  that 
the  value  of  continuing  on  bargaining  for  j  is  even  lower.  If  the  level  of  optimism  will  remain 
sufficiently  high  for  a  sufficiently  long  future,  then  players'  expectation  will  be  so  low  that  they 
will  reach  an  agreement  immediately. 

If  the  level  of  optimism  stays  very  high  for  a  long  while,  it  must  be  the  case  that  players' 
conditional  beliefs  do  not  vary  much  with  respect  to  what  happens  so  far.  Since  they  are 
Bayesian,  this  means  that  they  adhere  to  their  initial  beUefs,  and  thus  their  prior  beliefs  are 
very  firm.  On  the  other  hand,  if  players'  prior  beliefs  reflect  a  high  level  of  optimism  for  a  long 
future  and  their  prior  beliefs  are  firm,  then  the  level  of  optimism  will  remain  high.  Therefore, 
we  can  restate  our  theorem  as  follows.  If  players '  prior  beliefs  reflect  a  sufficiently  high  level 
of  optimism  for  a  sufficiently  long  immediate- future,  and  if  these  beliefs  are  sufficiently  firm, 
then  they  will  reach  an  agreement  immediately.^ 

Our  theorem  asks  for  three  conditions  for  an  immediate  agreement.  Firstly,  there  must  be 
a  sufficiently  long  future.  If  the  game  is  to  end  very  soon,  and  if  players  are  very  optimistic 
about  making  an  offer  at  the  end  of  the  game,  each  may  choose  to  wait  until  the  end,  when  his 
opponent  will  know  that  she  will  get  0  if  they  disagree  -  as  we  demonstrated  in  the  Introduction. 

Second,  they  must  remain  sufficiently  optimistic  for  a  long  while.  For,  even  if  the  game  is 
to  continue  indefinitely,  if  each  player  thinks  that  he  will  never  be  able  to  make  an  offer  after  a 
given  date,  they  will  act  as  if  the  game  ends  there,  hence  they  may  disagree  at  the  beginning, 
as  they  may  do  in  a  short  game.^ 

Finally,  it  asks  players'  initial  beliefs  be  sufficiently  firm,  that  is,  they  should  not  update 
their  beliefs  drastically  with  a  limited  observation.  There  are  three  reasons  for  this  requirement. 
Firstly,  it  is  essential  for  our  theorems  that  the  length  of  a  period  of  disagreement  regimes 
is  uniformly  bounded  under  NL,  as  there  may  be  a  disagreement  period  at  the  end  of  the 
game.  When  players  update  their  behefs  as  they  observe  which  players  are  recognized  and 
when,  whether  they  reach  an  agreement  at  a  given  date  depends  on  which  players  have  been 


For  formal  statements,  see  Yildiz  (2000). 

Since  players  will  wait  for  the  end  only  when  they  are  optimistic  about  the  end,  we  may  only  need  that 
the  level  of  optimism  does  not  drop  suddenly  and  drastically,  as  suggested  by  Theorem  2.  It  remains  an 
open  question  whether  we  can  (qualitatively)  extend  Theorem  2  beyond  Assumptions  TU  and  NL,  obtaining  a 
stronger  theorem. 
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recognized  and  when.  The  following  example  demonstrates  that,  when  the  beliefs  are  updated 
drastically,  the  length  of  such  a  disagreement  period  may  be  unbounded,  causing  (possibly 
long)  delays. 

Example  1  Consider  the  following  belief  structure:  For  each  i  6  N ,  and  for  some  p  >  1/2, 
we  have  p]  (pj  =  p  at  each  ''alternating"  history  pj  =  (/Jq  (u)  , . . .  ,Pf_i  (w))  where  p^  {ui)  ^ 
Pgj^i  [ijj]  at  each  s  <  i  —  1.  On  the  other  hand,  if  a  player  i  has  been  recognized  twice  in  a 
row  before  the  other  agent,  each  player  thinks  that  i  is  "blessed"  so  that  he  will  be  recognized 
with  probability  p,  i.e.,  we  have  P^{pt  =  Ps'  {^)\Pt)  =  P^iPt  =  Ps'  {^)\Pt)  =  p  at  each 
Pt  ~  {Po  i'^)  I  ■  ■  ■  iP(-i  (^))  "^^^^  ■s*  =  niin  {s  <  t  —  l\pg  {ui)  =  p^^j  (w)}  >  0.  Letting  i  =  oo, 
we  will  demonstrate  for  high  values  of  6  that  players  will  never  reach  an  agreement  at  an 
alternating  history.  In  that  case,  given  any  t,  at  the  beginning  of  the  game  each  player  believes 
that  with  positive  probability  they  will  not  reach  an  agreement  before  t. 

Consider  any  alternating  history  p^,  where  i  is  recognized  at  t  —  I.  If  i  is  recognized  at 
t,  too,  then  he  will  be  revealed  to  be  blessed.  In  that  case,  by  Proposition  2,  we  will  have 
Vt+i  [Pt^T)  =  Es>t  '5^"'p(l  -5)=p  and  V,\,  {p„i)  =  1  -  p  for  j  ^  z.  (Note  that  R=l-6.) 
Hence,  i  will  offer  to  take  I  —  6  {I  —  p),  leaving  S  {I  —  p)  to  j,  who  will  accept  the  offer.  Since 
Pt  iPt)  —  P  /^'~  each  k  E  N ,  we  now  have 

VI  [p^)     =    p{\~6{\-p))  +  {l-p)8Vl^,{p„3)  (22) 

.     ■•'  y/(pj     =    p6V^^,{p„j)  +  {l-p)6{\-p), 

where  {pt,j)  is  again  an  alternating  history.  The  subgames  starting  at  pj  and  {Pt,j,i)  o,re 
identical  so  that  Vt{pt)  =  Vt{pt,j,i),  hence,  at  any  alternating  history,  payoffs  depend  only 
on  the  last  player  recognized.  Furthermore,  our  game  is  symmetric  with  respect  to  the  players 
so  that  V^  (pt)  =  y^i  (pt,i)  =  V^  {p,6)  and  V/  (p,)  =  V;Vi  (Pt,j)  =  V^  {p,6).  Substituting 
these  values  into  (22),  we  obtain 


V^(p,6)  \  1 

VP{p,6)  j       l-6^p{l-p) 


1     6{\-p) 
dp  1 


**<' -*"-/».  (23) 


Letting  ^  — >  1,  one  can  check  from.  (23)  that,  for  every  p  >  1/2  there  exists  some  6  sufficiently 
close  to  1  such  that  V^  (p,  5)  +  V^  (p,  <5)  >  1/5,  when  we  will  have  a  disagreement  regime  at 
each  alternating  history. 

Second,  when  players  update  their  behefs  drastically,  we  will  have  another  form  of  the 
deadhne  effect,  as  illustrated  in  the  following  example. 

Example  2  Assuming  TU,  take  any  t*  with  0  <  i*  <  min{L  (6)  ,t].  Consider  the  case  where 
{P(),Pi,  ■  ■  ■  ,Pt')  0.1^6  stochastically  independent  under  both  P^  and  P^,  P^{pf  =  i)  =  1  for  each 
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i  G  N ,  and  pj  =  p^.  at  each  t  >  t* .  If  a  player  i  is  recognized  at  t* ,  the  other  player  j  will  know 
that  he  will  no  longer  be  able  to  make  any  offer,  hence  by  Proposition  2,  we  have  V^._^_^{pf..,i)  = 
0  at  each  p^. .  In  that  case,  we  will  have  m^  {8Vt+i  {pf,i))  =  1  —  8V/_^_^  {pi.,i)  =  1,  yielding 
^t'  {Pf )  =  1  ^y  (15).  Now,  Sf  [pt, )  =  Vjl  (p(. )  + 14?  (pj.  )  =  2  at  each  pj. ,  and  (pg,  pj , . . .  ,  pj. ) 
is  independently  distributed.  As  we  showed  in  subsection  3.4,  under  these  conditions,  they  viill 
not  reach  an  agreement  before  t* .  [Note  that  y^  =  1  and  yt  =  0  thereafter,  consistent  with 
perpetual  optimism.] 

Example  2  also  suggests  the  third  and  the  most  important  rationale  for  requiring  players' 
beliefs  be  firm:  When  a  player  is  known  to  update  his  beliefs  substantially  with  limited  ob- 
servation, his  opponent  may  be  willing  to  wait  and  let  him  observe,  hoping  that  he  will  be 
convinced  that  she  is  right.  For  instance,  in  our  example,  players  wait  until  t* ,  when  their 
opponents  are  expected  to  be  convinced  that  they  will  no  longer  be  able  to  make  any  offer. 

To  illustrate  our  rationale  in  a  greater  detail,  consider  the  canonical  case  of  affiliated  p, 
where  each  player  finds  a  player  i  more  likely  to  be  recognized  at  t  if  they  know  that  she  is 
recognized  at  some  s.  Assume  that  players  are  very  optimistic  about  t  and  that  they  find 
being  recognized  at  s  as  a  strong  evidence  of  being  recognized  at  i.  In  that  case,  they  must 
also  be  very  optimistic  about  s.  Now,  player  i  thinks  that,  if  they  disagree  until  s,  it  is  very 
likely  that  she  will  be  recognized  at  s,  and  hence  her  opponent,  namely  j,  will  be  convinced 
that  i  is  very  likely  to  be  recognized  at  t,  and  will  be  willing  to  agree  on  i's  terms.  Being  very 
optimistic  about  convincing  her  opponent  by  s,  player  i  will  not  settle  at  the  beginning  unless 
she  gets  a  high  share.  But  her  opponent  is  also  very  optimistic  about  convincing  i  by  s,  and 
thus  may  not  be  willing  to  give  i  what  she  wants  for  an  agreement  at  the  beginning.  In  fact,  if 
z's  continuation  value  at  s  increases  substantially  as  she  convinces  j,  and  if  s  is  close  enough, 
then  they  would  choose  to  wait  until  s  -  as  we  demonstrated  in  Example  2.  Oiu:  Theorem 
assumes  that  the  rate  of  learning  is  so  slow  that  either  the  impact  of  convincing  j  is  not  so 
large,  or  the  required  time  to  convince  him  is  too  long  to  wait. 

On  the  other  hand,  in  canonical  learning  models,  unless  their  beliefs  are  sufficiently  firm, 
players  update  their  beliefs  substantially  at  the  beginning,  whence  we  have  disagreement 
regimes  at  the  beginning  of  the  game,  and  thus  a  delay  in  reaching  an  agreement  on  the 
path  of  equilibrium.  Analyzing  such  a  canonical  learning-model,  in  our  next  section  we  will 
show  that  agreement  can  be  delayed,  and  will  derive  a  strict  bound  on  the  settlement  date. 

6     A  stylized  model  with  le2irning 

Without  restricting  how  fast  players  update  their  beliefs,  but  assuming  away  the  deadline 
effects,  we  will  now  explore  when  players  reach  an  agreement  in  a  canonical  case,  characterized 
by  Assumption  TU  and  Assumption  XB  below,  which  we  will  maintain  throughout  this  section. 
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Under  these  two  assumptions,  we  will  show  for  f  =  oo  that  there  exists  a  predetermined 
date  such  that  players  will  never  reach  an  agreement  before  that  date  and  will  always  agree 
thereafter.  Throughout  this  section,  we  will  take  T  =  N,  i.e.,  i  =  oo.  (Our  results  are  also 
valid  for  any  sufficiently  large  i;  see  Yildiz,  2000). 

Assumption  XB  Given  any  t,s,r  G  N  with  r  <  t  <  s,  and  any  history  p[  of  recognitions 
where  player  1  is  recognized  r  times  out  of  t,  for  each  i  &  N,  v/e  have 

^■'^-  =  ■'"''  =  T^  P^' 

for  some  kj,h  &  N  with  k^  <  h. 

This  assumption  holds  when  each  player  believes  that  p  is  exchangeable,  i.e.,  it  is  identically 
and  independently  distributed  with  some  unknown  parameter  ^  measuring  the  probability  of 
{Pt  =  1},  and  p  is  distributed  with  a  beta  distribution  such  that  as  if  each  i  had  a  uniform 
distribution  on  [0,1]  for  p  and  observed  h  (pre- bargaining)  trials  at  an  interim  stage  where 
player  1  was  recognized  k^  times  (see  Fudenberg  and  Levine,  1998).  Of  course,  each  player 
believes  that  his  own  h  trials  are  relevant  for  this  bargaining.  We  will  write  K  =  k\  —  k2- 
While  h  measures  the  level  of  conviction  in  players'  prior  beliefs,  K/  [h  +  2)  will  be  shown  to 
measure  the  initial  level  of  optimism. 

Note  that,  since  i  believes  that  p  is  identically  distributed,  his  beliefs  about  p^  does  not 
depend  on  s.  Substituting  (24)  in  definition  of  y,  we  obtain  that 

at  each  pj  e  A^'  and  s  >  t.  Two  properties  of  y  are  important  for  us.  Firstly,  since  players' 
beliefs  about  s  do  not  depend  on  s,  the  discrepancy  in  beliefs  about  s  does  not  depend  on  s, 
either.  It  depends  only  on  the  date  the  behefs  are  held,  i.e.,  ys{Pt)  =  VtiPt)  for  each  s  >  t. 
Therefore,  we  denote  it  by  y^^j  -  indexed  only  by  the  date  the  behefs  are  held.  Note  that  j/(f) 
measures  the  discrepancy  in  players'  behefs  about  any  recognition  in  future  at  the  beginning 
of  date  t,  while  yt  denotes  the  entire  process  of  the  discrepancy  in  behefs  about  the  recognition 
at  t,  whose  value  at  the  beginning  of  t  is  yuy  When  ki  <  k2,  we  have  j/(j)  <  0  throughout  so 
that  we  have  agreement  regimes  throughout  the  game  as  in  the  Rubinstein-Stahl  framework. 
We  will  assume  that  ki  >  k2  (i.e.,  K  >  0),  rendering  y(()  >  0  throughout. 

Second,  yt  is  deterministic,  i.e.,  yt  {Pg)  =  yt  (p's)  for  each  Ps,  p's  €  N^  with  s  <  t.  This  is 
due  to  our  assumption  that  h  is  same  for  both  players.  This  renders  R  and  S  also  deterministic 
-  even  though  V  is  not  deterministic.  (Recall  that  St  =  V/  +  VJ^  and  B\{pt)  =  max{0, 1  — 
6St+i{Pt,i)}.) 

Lemma  3  Under  Assumptions  TU  and  XB,  given  any  t  E  T  =  N  and  i  £  N,  St  and  R\  are 
deterministic,  and  hence  R^  =  B^  =  Rt  for  some  Rt  &  K. 
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When  5(+i  is  deterministic,  Rl  is  deterministic  and  B\  =  B^.  In  that  case,  Si  will  also  be 
deterministic,  giving  us  our  Lemma.  Since  St+\  and  6  determine  whether  we  have  an  agreement 
regime  at  i  G  T,  the  event  that  players  reach  an  agreement  at  t  is  also  deterministic. 

Writing 


Kt  =  Y.^'-'Rs  (26) 


s=t 

for  the  present  value  of  all  future  rents,  we  express  V  and  S  in  terms  of  A: 

Lemma  4  Assume  TU  and  XB.  Given  any  p^  e  N^  and  any  i  &  N,  we  have 

Vt\pt)     =    p\{Pt)Kt  (27) 

StiPt)    =   _(l  +  y(t))At.  (28) 

The  first  statement,  (27),  is  a  simple  application  of  Proposition  2,  Lemma  3,  and  the  fact 
that  pI  (pj)  =  pI  (pj)  for  each  s  >t.  Adding  up  (27)  over  the  players,  we  obtain  (28).  Since  pj 
is  stochastic,  so  is  Vj'.    On  the  other  hand,  both  j/(t)  and  Aj  are  deterministic,  and  so  is  St- 

Using  (28),  we  can  decouple  (26),  and  obtain  a  difference  equation  for  A.  Consider  any  t 
with  an  agreement  regime.  Now,  Rt  =  1  —  6St+i,  hence  (28)  yields  Rt  =  1  —  6{1  +y(t+i))^t+i- 
Moreover,  by  (26),  we  have  At  =  Rt  +  ^Aj+i.  Combining  these  two  equations,  we  obtain 

At  =  !-%+!)  At+i-  (29) 

When  we  have  a  disagreement  regime  at  t,  we  clearly  have  At  =  SAt+i.  These  difference 
equations  imply  that  A  is  uniformly  bounded  by  1. 

By  (28),  we   have  an  agreement  regime  at  any  i  —  1  G  T  iff 

At<— -^ ,  =  D{t).  (30) 

^(i  +  y(t)) 

On  the  right-hand  side,  as  t  gets  larger,  y(^t)  converges  to  zero  so  that  D  (t)  approaches  to  1/6. 
On  the  left-hand  side,  A  is  uniformly  bounded  by  1,  which  is  less  than  1/S.  Thus,  inequality 
(30)  holds  for  every  sufficiently  large  t,  yielding  an  agreement  regime  at  i  —  1.  For  instance, 
whenever  t  >  to  =  jzrs-^  —  h  —  2,  we  have  j/(j)  <  |  —  1,  yielding  D  (t)  >  1,  and  thereby 
rendering  an  agreement  regime  at  i  —  1. 
Let  us  write 

PA  =  {te  T\As  <  D  (s)  Vs  >  t} 

for  the  interval  of  perpetual  agreement,  consisting  of  the  dates  t  such  that  we  have  an  agreement 
regime  at  each  s  >  t.  Since  to  e  PA,  PA  C  N  is  non-empty,  and  thus  possesses  some  minimum 
t**  <  to. 
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We  will  now  derive  a  stricter  upper-bound  for  t** ,  the  date  the  period  of  perpetual  agree- 
ment starts.  Writing 

B{t)=         ^ 


1  +  ^y(t) 

and  using  (29)  repeatedly,  we  first  obtain  the  following  bounds  for  Af. 

Lemma  5  Let  i  =  oo,  6  e  (0,1),  and  assume  TU  and  XB.  Then,  B  {i)  <  h.t  <  B  {t  +  1)  at 
each  t  >t**. 

We  plot  our  bounds  in  Figure  2.  B{t)  converges  to  1  as  i  — >  oo.  Hence,  for  sufficiently 
large  t,  B  {t)  and  B  {t  +  1)  are  arbitrarily  close  to  each  other,  providing  very  precise  bounds 
for  A.  As  shown  in  the  Figure,  our  bounds  are  valid  only  when  t  >  t**,  because  the  recursive 
equation  (29)  holds  only  at  agreement  regimes. 

Let  us  compare  our  bounds  with  D,  the  process  that  determines  whether  we  have  an 
agreement  at  any  given  date.  Clearly,  B  {t)  <  D  {t)  at  each  t  so  that  our  lower  bound  stays  in 
agreement  regime  throughout.  In  comparing  our  upper  bound  B{t  +  1)  with  D{t),  we  write 

Since  y^t)  -  y(,+i)  =  (i_^;,_^2J(t+fe+3) '  *^*^  implies  that  B{t  +  l)  <  D  [t)  iff 


_viiM. 
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t>tcut  = ^ h-2.  (32) 

Consequently,  when  t  >  tcut,  inequality  At  <  B{t  +  \)  <  D  {t)  holds,  yielding  an  agreement 
regime  at  i  —  1.^  This  gives  us  our  bound  for  t**: 

0<<**  <max{0,  [Wj}-  (33) 

We  exhibit  some  values  of  t**  and  tcut  ior  h  =  K  =  I  aX  Table  1.  When  6  is  sufficiently 
small,^  we  have  agreement  regimes  throughout,  in  particular  at  the  beginning  of  the  game. 
As  d  increases,  some  disagreement  becomes  possible;  in  fact,  the  perpetual  agreement  may  be 
delayed  for  a  long  while. 

It  turns  out  that  there  cannot  be  any  agreement  regime  before  f**,  the  date  the  period  of 
perpetual  agreement  starts.  This  gives  us  our  main  Theorem  in  this  section. 


Note  that  this  is  precisely  the  region  where  y^t)  —  2/(t+i)  ^  (^  ~  ^)  /^t  where  we  also  have  j/(s)  —  y^s  +  l)  < 
(1  —  S)  /6  for  each  s  >  t.  This  is  analogous  to  Theorem  2,  which  implies  under  NL  and  TU  that  we  will  have 
an  agreement  regime  at  t  —  1  whenever  y,  —  ys+i  <  (1  —  6)  /i5  for  each  s  >  t. 

'For  instance,  when  6  <     .     j'^2_    -1-1        ,  we  have  [tcutj  <  0  so  that  t"  =  0. 
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Figure  2:  Functions  D,  B,  B{t  +  1),  and  A.   A*  denotes  the  solution  to  (29),  which  is  vahd 
only  on  PA.  Note  that  tcut  =  6.4624.  [6  =  0.99,  h  =  K  =l,t  =  500.] 
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Table  1:  tcut  and  t**  vs.  6.  {h  =  K  =  I,  t  is  very  large.) 
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Theorem  5  Let  i  =  oo,  b  G  (0,1),  and  assume  TU  and  XB.  Then,  there  exists  at**  G  T 
such  that,  at  each  t  >  t** ,  players  reach  an  agreement  immediately  if  they  have  not  reached  an 
agreement  yet,  and  they  do  not  reach  an  agreement  before  t** . 

Proof.  By  definition,  we  have  an  agreement  regime  at  each  t  >  t**  =  ra\nPA,  hence 
we  will  only  show  that  we  have  a  disagreement  regime  at  each  t  <  t**.  If  t**  =  0,  this  is 
vacuously  true,  so,  we  assume  that  t**  >  0.  In  that  case,  i**  —  1  <  tcut  by  (33);  and  we  have  a 
disagreement  regime  at  t**  —  1  by  definition  of  t**.  Now  we  will  show  that,  whenever  we  have 
a  disagreement  regime  at  any  t  <  tcut,  we  will  have  a  disagreement  regime  at  t  —  1,  showing 
by  mathematical  induction  that  we  have  a  disagreement  regime  at  each  s  <  t**  —  1. 

To  this  end,  take  some  t  <  tcut  with  a  disagreement  regime  so  that  St+i  >  1/6.  We  have 
Rt  =  0,  and  hence  Aj  =  6At+i-  By  (28),  this  yields 

i  +  y{t+\) 

Hence,  St  >  St+i  whenever  l  +  y^t+i)  <  ^(1  +  2/(1)),  i-e-,  whenever  y(t)  -  •|y((+i)  >  ^.  But  this 
is  always  true:  y{t+i)  >  0  and  t  <  tcut,  hence  i/(f)  -  ^y{t+i)  >  y(t)  -  y(t+\)  >  ^-  Therefore, 
we  have  St  >  St+i  >  1/6,  and  hence  a  disagreement  regime  at  t  —  1.    ■ 

For  a  very  canonical  case,  Theorem  5  states  that,  unless  players'  initial  beliefs  are  so  firm 
that  our  Agreement  Theorems  apply,  reaching  an  agreement  will  be  delayed  for  a  while.  For, 
typically,  at  the  beginning  of  a  learning  process  players  are  more  open  to  new  information,  in 
the  sense  that  they  update  their  beliefs  substantially  as  they  observe  which  player  gets  a  chance 
to  make  an  off'er.  Knowing  this,  each  player  waits,  believing  that  the  events  are  very  likely  to 
proceed  in  such  a  way  that  his  opponent  will  change  his  mind.  As  time  passes,  they  become 
experienced.  In  this  way,  two  things  occur  simultaneously,  both  facihtating  agreement.  Firstly, 
having  similar  experiences,  the  discrepancy  in  their  beliefs  diminishes.  Secondly,  they  become 
so  closed  minded  that  their  opponents  lose  their  hope  to  convince  them  and  thus  become  more 
wilUng  to  agree  in  their  terms.  Therefore,  after  a  while,  they  reach  an  agreement. 

At  the  beginning  of  our  game  it  is  common  knowledge  that  they  will  not  reach  an  agreement 
until  t** ,  when  they  will  reach  an  agreement  no  matter  what  happens  by  then.  How  they  will 
share  the  pie  at  t**  will  depend  on  how  many  times  each  player  will  have  been  recognized. 
Since  they  disagree  about  how  many  times  each  player  is  hkely  to  be  recognized  by  t**,  there  is 
no  consensus  among  our  players  on  how  they  can  better  each  of  them  by  agreeing  on  a  decision 
at  the  beginning.  Therefore,  they  wait  imtil  t**  even  though  there  is  a  consensus  among  them 
that  there  is  an  agreement  at  the  beginning  that  would  leave  each  player  better  off. 

How  long  can  they  delay  the  agreement?  We  showed  that  the  delay  can  be  at  most 
tcut{h,K,6).  Now,  given  any  K  and  6,  as  /i  — >  oo,  tcut{h,K,6)  — >  — oo,  so  that  t**  =  0 
for  sufficiently  large  values  of  h.   There  are  two  factors  behind  this  limit.  Firstly,  as  /i  — >  oo, 
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players'  initial  beliefs  become  very  firm,  approximating  an  independently  distributed  p  in  the 
limit,  thus  by  Theorem  1,  they  reach  an  agreement  immediately.  But  more  importantly,  the 
discrepancy  in  beliefs  also  disappears  since  y(()  <  y(o)  =  t^/{h  +  2),  converging  to  0  as  /z  — »  oo. 
It  would  be  more  interesting  then  to  set  K  =  yo{h  +  2)  for  some  yo  G  (0, 1]  so  that  the  initial 
level  of  optimism  is  always  yo-  In  that  case,  for  i  >  0,  we  will  have  y(j)  =  yo/{-Ar[  +  1)  ^*  yo 
as  h  —^  oo.  Now,  tcut  =  \/ah~+b  —  h  —  5/2  where  a  =  y^S/  (1  —  5)  and  b  =  2a  +  1/4.  As  h 
increases,  initially,  tcut  also  increases,  possibly  due  to  the  increasing  level  of  optimism.  After  a 
point,  however,  tcut  starts  decreasing,  and  approaches  to  — oo  in  the  limit.  Once  again,  there 
will  be  no  delay  in  agreement  for  large  values  of  h,  consistent  with  our  Agreement  Theorems. 
Finally,  as  5  ^  1,  t^ut  {h,  K,  6)  -^  oo.  The  intuition  for  this  result  is  clear.  Very  patient  players 
can  wait  very  long  to  find  out  whose  beliefs  are  more  accurate. 

We  will  now  measure  how  long  real-time  delay  we  can  have  in  the  continuous-time  hmit.  We 
measure  index-time  t  in  terms  of  real  time  by  r  {t,  n)  =  t/n,  where  ra  >  0  measures  the  fineness 
of  the  grid.  Now,  discount  rate  is  <5  (n)  =  exp  {—r/n)  where  r  >  0  is  the  real-time  impatience. 
Clearly,  exp(— rr  (<,n))  =  (5*,  and  thus  t  {t,n)  =  Mog(6~').  Therefore,  the  maximum  delay 
in  real  time  is  ^  log  ('5-«-t('^,/c.5)^ 

If  h  and  K  do  not  depend  on  n,  we  can  measure  the  real-time  delay  in  the  continuous  time 
hmit  by  hm5_i  ^  ^og  (s~^'="^-^'^'^^^n.    One  can  check  from  (32)  that,  given  any  h  and  K,  as 

5  — >  1,  5""''="*'  '  '  ^  ^  1  so  that  there  is  no  real-time  delay  in  the  limit.  That  is,  given  any  h 
and  K,  the  delay  in  agreement  can  be  bounded  to  be  arbitrarily  short  by  recognizing  players 
sufficiently  fast. 

It  is  crucial  for  this  result  that  h  and  K  are  taken  to  be  independent  of  n  so  that  the 
discrepancy  in  beliefs  vanishes  arbitrarily  fast  as  n  gets  sufliciently  large,  i.e.,  the  discrepancy 
at  any  real  time  r  is  K/  (rn  -\-  h+2),  converging  to  0  as  n  goes  to  oo. 

In  updating  his  beliefs,  a  player  might  take  into  account  how  fast  the  players  are  recognized. 
In  particular,  one  might  attribute  the  recognition  of  a  player  at  a  given  time  to  the  player's 
innate  abilities  little  if  players  are  to  be  recognized  very  frequently.  Therefore,  in  order  to 
measure  the  effect  of  the  frequent  recognition  purely,  one  may  want  to  adjust  h  and  K  accord- 
ingly so  that  the  discrepancy  in  beliefs  at  a  given  real-time  does  not  depend  on  the  fineness  n 
of  the  grid  much.  To  do  this,  let  us  take  h  =  n  and  K  =  yo{h  +  2)  so  that  the  discrepancy 
in  behefs  at  a  given  real  time  t  is  yo/{^^T  +  1),  which  is  approximately  yo/{T  +  1)  for  large 
values  of  n.  As  n  — »  oo,  6{n)  — >  1,  and  h  and  /C  go  to  oo  in  such  a  way  that  the  initial  level 
of  optimism  stays  at  j/o  unchanged.  We  have  already  checked  that,  when  these  two  limits  are 
taken  separately,  in  both  cases  6'*="'  goes  to  1,  rendering  immediate  agreement  in  the  limit. 
When  n  — >  oo,  however,  we  compute  that 

lim  T{tcut{h{n),K{n),8{n)),n)  =  \/yJr  -  1 
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so  that,  in  the  continuous  time  limit,  we  can  have  very  long  real-time  delays  if  the  players  are 
both  patient  (with  low  r)  and  optimistic. 

7     Impact  of  Risk- Aversion 

In  this  section,  reinstating  Assumption  NL,  we  wish  to  demonstrate  the  impact  of  risk  aversion 
on  the  problem  of  reaching  an  agreement.  We  present  a  case  where  players  disagree  at  the 
beginning  of  the  game  even  though  y  >  0  and  the  game  can  be  arbitrarily  long.  That  is,  y* 
of  Theorem  3  needs  to  be  strictly  positive,  and  thus  Theorem  1  could  not  be  extended  beyond 
TU  in  its  full  strength. 

Example  3  Assummg  NL,  take  U  =  {u  e  [0,l]'^\{u^f  +  [u'^)^  <  l},^^  i  >  3,  6  e  (0.907,1), 
and 

p'     =     (1,1,1/2,0,0,...  ,0) 
p2     =     (0,1,1/2,1,1,...  ,1), 

so  that  y  =  (0, 1,0,0, . . .  ,0).    We  will  have  a  disagreement  regime  at  t  =  0. 

Now,  by  Proposition  2,  V3  =  (0, 1).  Hence,  att  =  2,  if  recognized,  players  1  and  2  will  offer 
(vl  —S^,6)  and  (0,1),  respectively.   The  offers  will  be  accepted.   Thus, 

\(   I -^    N       1,^     ,       (\/\-b^    l  +  5\ 


v.  =  l(x/rr7^..)+i(o, 


At  the  beginning  of  date  t  =  1,  player  1  is  certain  that  he  will  be  recognized,  and  will  offer 
1  —  {6V2)  ,^^2^),  which  will  be  accepted.   Thus,  his  continuation  value  will  be 


.2_./.        .2/1+5^' 


Vl  =  ^l-{6ViY  =  ^l-6^^]    .  (34) 

Likewise, 


V,'  =  y/i-mf  =  ^i-S^'-^ 


(35) 


4 

We  have  a  disagreement  regime  att  =  Oif  {V^)  +  [Vi)  >  1/6  .  By  (34)  and  (35),  this  is 
the  case  when2-^  (1  +  ^)  >  ^.  Writing  f  {6)  =  2- ^  (1  +  5) -•^,  we  observe  that  f{l)  =  0, 
/'(I)  =  —1/2  <  0,  thus  f{S)  >  0  whenever  6  is  sufficiently  close  to  1.  In  fact,  one  can  check 
that,  for  any  6  G  (0.907, 1),  we  have  f{5)  >  0,  hence  we  have  a  disagreement  regime  at  t  =  0. 


"This  would  be  the  case  if  players  were  dividing  a  dollar  and  each  had  utility  function  x  1— >  ^/x. 
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In  a  game  with  perpetual  optimism,  each  player  adjusts  his  expectations  down,  knowing 
that  his  opponent  has  very  optimistic  beliefs  -  even  though  he  believes  that  his  opponent  is 
wrong.  With  transferable  utility  this  adjustment  was  enough  for  agreement.  When  players  are 
risk- averse,  however,  uncertainty  about  which  player  will  get  the  rent  lowers  their  continuation 
values  at  the  beginning  of  any  date  with  an  agreement  regime,  increasing  weakly  the  rent  for 
the  player  recognized  at  the  previous  date.  If  in  addition  players  are  highly  optimistic  for  this 
date,  such  an  increase  may  be  high  enough  for  preventing  them  from  agreeing  at  the  date 
before  -  as  our  Example  demonstrates. 

8      Conclusion 

As  suggested  by  Edgeworth  (1881),  with  complete  information,  in  equilibrium,  bargaining 
results  in  an  optimal  outcome.  Given  that  the  delays  are  costly,  this  also  implies  that  the 
agreement  is  reached  immediately.  Yet,  the  agreement  is  delayed  as  a  rule  in  real  life.  A 
prominently  proposed  explanation  for  the  delay  is  parties'  excessive  optimism. 

In  this  paper,  we  analyzed  the  problem  of  reaching  an  agreement  in  a  model  where  players 
are  possibly  optimistic  about  the  recognition  process,  the  ultimate  source  of  bargaining  power 
in  sequential  bargaining  with  no  outside  option.  In  order  to  see  the  pure  impact  of  optimism, 
in  our  analysis  we  relaxed  only  the  common  prior  assumption  of  complete  information,  and 
adhered  to  the  equilibrium. 

We  showed  that  the  excessive  optimism  alone  cannot  be  a  reason  for  the  delay.  When 
players  are  sufficiently  optimistic  for  a  long  while,  recognizing  that  his  opponent  will  remain 
optimistic  for  a  long  while,  each  player  will  lower  his  expectations  about  the  future  so  that  they 
will  reach  an  agreement  immediately.  In  other  words,  presence  of  perpetual  optimism  includes 
also  the  bad  news  for  each  player  that  his  opponent  wiU  remain  optimistic,  inducing  a  form  of 
pessimism  that  moderates  his  optimism  to  the  extent  that  they  settle  at  the  beginning. 

Corroborating  this,  we  further  showed  in  a  special  case  that  we  will  have  an  immediate 
agreement  so  long  as  there  are  no  sudden  jumps  in  the  level  of  optimism,  providing  a  ratio- 
nale for  the  immediate  agreement  result  in  complete-information  models,  where  the  level  of 
optimism  is  constantly  nil.  It  is  not  the  absence  or  the  presence  of  optimism  that  determines 
whether  we  have  an  immediate  agreement;  that  is  determined  by  the  change  in  the  level  of 
optimism. 

Relaxing  the  common  prior  assumption  enriches  our  understanding  further.  Taking  differ- 
ences in  information  to  be  the  only  source  of  differences  in  beliefs,  the  common  prior  assumption 
allows  only  one  notion  of  convincing,  namely  communicating  the  privately  known  information 
in  a  credible  way.  While  the  delays  are  used  as  a  means  to  convince  the  other  party  in  this 
sense,  another  notion  of  convincing  becomes  salient  in  our  model,  providing  a  distinct  rationale 
for  delay.   When  players'  beliefs  are  substantially  different  from  each  other,  a  player  may  be 
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willing  to  wait  so  that  his  opponent  will  observe  more  facts  and  have  a  better  understanding 
of  the  truth,  which  is  presumably  like  our  player  thinks. 

A      Appendix  -  Monotonicity  of  payoffs  with  respect  to  beliefs 

Here  we  will  show  for  affiliated  recognition  processes  that  a  player's  equilibrium  payoff  increases  as  he 
becomes  more  likely  to  make  an  offer  in  the  future.  We  will  first  present  an  example  showing  that  this 
is  not  true  when  the  recognition  process  is  not  affiliated. 

Example  4  Take  U  =  <  (^u^,u^)  6  [0.1]"  1"^  +u^  <  1  f,  i  =  oo,  6  6  (3  —  v^,  1),  and  consider  the 
following  family  of  recognition  processes  p,  parametrized  by  player  1  's  beliefs  about  pj .  Under  P^ ,  p 
IS  independently  distributed  with  P^  [pQ  =  1)  =  0,  P^  (pj  =1)  =  tt,  and  P^  [p^  =  1)  =  1/2  at  each 
s  >  \.  Player  2  's  beliefs  are  similar  to  those  of  player  1,  except  that  player  2  believes  that  he  will  not 
be  recognized  at  t  =  1.  and  if  he  is  recognized  at  t  =  I,  he  will  never  be  recognized  again.  That  is,  given 
any  p^  =  (pg  (w)  ,pi  {uj) ,.. .  ,Pi_i  (oj))  with  t  >  2,  for  any  s  >t,  we  have 

p2(p.  =  2|p,)  =  |  ^/2    ^/^iM  =  i. 

I    U         othenmse; 

and  P^  (po  =  2)  =  1  and  P^  {pi  =  2|po)  =  0.  We  will  show  that  player  1  's  continuation  value  at  the 
beginning  of  date  1  is  decreasing  with  n  =  P^  (pj  =  1).  Hence,  when  it  gets  higher,  player  2  will  offer 
player  1  a  lower  share  at  date  0;  and  the  offer  is  to  be  accepted. 

If  player  1  is  recognized  att  =  1,  we  unll  have  the  Rubinstein- Stahl  framework  with  identical  players, 
hence  he  will  offer  (1  —  6/2,6/2),  which  will  be  accepted.  (The  continuation  values  at  t  =  2  are  1/2. J 
Take  any  Pf  where  player  2  is  recognized  at  date  1.  Now,  Player  2  will  always  think  that  he  will 
never  make  an  offer;  hence,  by  Proposition  2,  Vj'^j  (P(+i)  =  0.  Hence  at  t,  if  recognized,  player 
1  offers  (1,0).  which  is  to  be  accepted.  Thus,  by  (20),  V^  (pj  =  2  '  ^  "*"  2  '  ^^(+1  (P<+i)  ■  Since 
^t  (Pt)  —  ^i+i  {Pt+i) '  ^^  then  have  V^■  (pj)  =  1/(2  —  i5).  Therefore,  if  player  2  is  recognized  at  date 
1,  he  will  offer  [6/  {2  —  6)  ,1  —  6/  {2  —  6)),  which  is  to  be  accepted.  The  continuation  value  of  player  1 
at  the  beginning  of  date  1  is  then 

V/  =  TT  •  (1  -  6/2)  +  (1  -  tt)  •      ^ 


2-6' 


Since  5  6  (3  —  \/5, 1),  6/  {2  —  6)  >  I  —  6/2,  hence  V^  is  decreasing  with  tt.  (Note  that  V^  =  6/2,  hence 
Vl  +V\   <\,  thus  they  reach  an  agreement  at  t  =  0.) 

Proposition  2  stated  that  being  recognized  at  some  t  contains  a  good  news  for  a  player  that  he  will 
extract  some  non-negative  rent.  In  our  example,  for  player  1,  the  recognition  at  date  1  comes  also  with 
the  bad  news  that  his  opponent  is  not  excessively  pessimistic,  hence  now  it  is  common  knowledge  that, 
if  they  wait,  he  will  not  be  able  to  get  more  than  half  the  dollar.  This  bad  news  is  dominant;  and  being 
recognized  at  date  1  is  overall  bad  news  for  player  1.  If  player  1  comes  to  believe  that  he  is  more  likely 
to  receive  such  bad  news,  he  will  have  a  weaker  bargaining  position  at  date  0,  in  which  case  he  will  get 
a  smaller  share  in  the  dollar. 
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Now  we  focus  on  a  canonical  class  of  recognition  processes,  where  all  these  news  are  good.  A 
recognition-process  p  is  said  to  be  (weakly)  affiliated  iff 

P'iPs  =i\Pto  =hPu,Pt^,---  ,PtJ  >  P"  [Ps  =  T-\Pto  ^i,Pu,Pt^,---  ,Pt„) 

for  each  finite  sequence  {to,ti, . . .  ,i„)  of  non-negative  integers,  for  each  i  €  N  and  s  G  N.  Note  that, 
if  p  is  independently  distributed  or  exchangeable,  it  is  weakly  affiliated.  Note  also  that  we  define  the 
affiliation  among  all  the  members  of  a  process,  while  it  is  usually  defined  between  two  random  variables, 
say  Pj  and  p^g,  which  corresponds  to  the  sequence  with  to  =  *i  =  •  •  -  =  *«•  If  a  process  p  is  affiliated, 
then  we  have 

F'  {p,  =  i\i,pt)  >  P'  {p,  =  i\j,p^)  and  P^  {p,  =  j\i,p,)  <  P^  {p,  =  j\j,p,)  (36) 

at  any  two  histories  {i,Pt)  and  {j,Pt)  that  start  with  distinct  players  i  and  j  at  date  0,  but  continues 
with  identical  players. 

Proposition  3    Given  any  two  games  G*  [6,p]  and  G*  [6,p]  with 

P'  i'Ps  =  i\Pt)  >  P'  {Ps  =  Apt)   and  P'  {p,  =  j\p^)  <  P^  [p,  =  jift)  (37) 

at  each  s  and  p^  for  some  distinct  players  i  and  j,  if  p  is  affiliated,  then 

V'  [p]  >  V'  [p]   and  V^  [p]  <  V^  [p] ,  (38) 

where  the  terms  in  [■]  indicates  the  underlying  recognition  process. 

That  is,  when  the  recognition  process  is  affiliated,  if  players'  beliefs  change  in  such  a  way  that  each 
player  finds  a  player  i  at  least  as  likely  to  make  an  offer  as  before,  player  i  (if  anything)  gains  and  his 
opponent  (if  anything)  loses.  A  similar  proposition  for  the  Rubinstein-Stahl  framework  can  be  found 
in  Merlo  and  Wilson  (1995),  where  the  recognition  process  is  affiliated  by  their  modelling  assumptions. 

Proof.  It  suffices  to  prove  the  Proposition  for  finite-horizon  case.  For,  the  continuation  values  in 
an  infinite-horizon  game  is  defined  as  limits  of  the  continuation  values  in  finite-horizon  games;  and  the 
inequalities  will  remain  to  hold  as  we  take  the  limit. 

Assuming  that  f  6  N,  we  will  use  mathematical  induction  on  the  length  t.  Firstly,  for  t  =  0,  we 
simply  have  V^  [p]  =  Vj  [p]  =  Vj  [p]  =  V^  [p]  =  0,  and  our  Proposition  is  trivially  satisfied.  Assume  that 
our  Proposition  holds  for  all  games  of  some  length  i  and  for  every  (n,P^,P^).  Setting  t  =  t  +  \,  take 
any  two  games  G*  [5,p]  and  G*  [6,p\  satisfying  the  hypotheses  of  the  Proposition.  Given  any  history 
hi,o  =  (k,  ui,  Reject)  &  N  xU  x  {Reject},  consider  subgames  G{,  [6,  p]  and  Gf,  [S,  p]  starting  at  hi,o, 
which  are  of  length  i  =t  —  1.  Conditional  on  that  A;  is  recognized  at  0,  these  subgames  trivially  satisfy 

(37)  and  process  {pi,P2,  ■  ■  ■  jPt-i)  is  affiliated  (for  p  is  affihated).  Hence,  by  our  induction  assumption, 

(38)  holds  for  subgames  Gl^^  ^  [5,/9]  and  Gj,j  ^  [S,p],  i.e.,  we  have 

V^  [p]  iPt)  >  VI  [p]  (pj  and  Vi  [p\  (pj  <  K/  \p\  ip,)  (39) 

at  each  t  >  1  and  each  p^.  Thus,  we  only  need  to  show  that  Vq  [p]  >  Vq  [p]  and  Vq  \p]  <  Vq  [p\. 
Recall  that,  by  (15),  we  have  V^  \p]  =  P"  (Po  =  i)  m'{SV,  [p]  {i))  +  P'  (po  =  j)  6V{  [p]  (j)  and  V^  [p]  = 
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P'  (po  =  i)  Tn'(6Vi  [p]  (?;))  +  P'  (po  =  j)  5K/  [p]  (j).  Hence,  we  can  decompose  V'J  [p]  -  V^  \p]  as 

yo[~p]-Vo{p\     =     P'(Po  =  i)[m'{SV,[p](i))-m'{6V,{p]{i))]  _  (40) 

+P'{Po  =  j)[SV^\p]U)-6Vnp]U)]  (41) 

+  [P=  (po  =  z)  -  P^  (po  =  i)]  [m'{5V,  [p]  {i})  -  6V^  [p]  (j)]  .  (42) 

We  will  show  that  all  the  terms  in  [J's  are  non-negative.  Firstly,  note  that  m'  (v)  is  non-decreasing 
with  v'  and  non-increasing  with  v^ .  Hence,  by  (39),  m^{6Vi  \p\  [i))  —  m^{6Vi  [p]  (i))  >  0.  By  (39),  we 
also  have  6Vi  [p]  {j)  —  6Vl  [p]  (j)  >  0;  thus  the  terms  in  lines  (40)  and  (41)  are  non-negative.  To  show 
that  m^(6Vi  [p]  [i))  —  6Vi  [p]  {j)  >  0,  we  need  to  compare  the  subgames  G*  [6,p]  and  G'  [S,p]  starting 
at  histories  (i,w, Reject)  and  (j,u, Reject),  respectively.  Isomorphic  to  each  such  subgame  GJ.  [6,p],  we 
have  a  game  G'~'  ['^'Z''^]  where 

p'{p';  =  i\p,)  =  p'{p^  =  i\k,p,) 

at  each  p,  and  I  £  N.  {p  and  p'^  can  be  defined  on  different  spaces.)  Since  p  is  affiliated,  by  (36),  we  have 
P'  {Pl  =  ^\Pt)  >  P'  {pi  =  ^\Pt)  and  P^  {pi  =  j\pt)  <  PJ  {p^  =  j\pt)  at  each  p,.  Since  p  is  affiliated, 
so  are  p'  and  p' .  Therefore,  by  our  induction  assumption,  we  have  V^  [p']  >  V^  [p'];  in  particular, 
Vj'  [p]  (i)  =  V^  [p']  >  V^  [p']  =  VI  \p]  ij).  Similarly,  we  have  V/  [p]  (i)  <  F/  [p]  (j).  Since  m'  is  non- 
decreasing  in  ti'  and  non-increasing  in  u-',  it  follows  that  m^{6Vi  [p]  (z))  >  m^{SVi  [p]  (j))  >  5Vi  [p]  (j), 
where  the  last  inequality  is  due  to  m'  (v)  >  v.  Therefore,  Tn^{SVi  [p]  (i))  —  6Vi  [p]  {j)  >  0,  and  thus  the 
term  in  line  (42)  is  non-negative,  showing  that  Vq  [p]  -  Vq  [p]  >  0,  and  thus  V  [p]  >  V-  [p].  Similarly, 
we  have  V^  [p]  <  V^  [p];  and  by  induction  hypothesis,  the  proof  is  complete.    ■ 

B      Appendix— Proofs 

Here  we  will  present  the  proofs  that  we  omitted  in  the  text.  We  will  present  these  proofs  under  the 
titles  of  the  sections  they  belong  to.  When  it  is  needed,  we  will  write  the  underlying  parameters 
in  [],  e.g.,  K[i,p],  V[p],  etc.  We  will  also  use  the  supremum  metric  for  the  functions,  i.e.,  |p  —  g|  = 
maxt,j^p^  \pI  (pj)  —  ql  (Ps)|,  and  we  write  p  >  A  €  K  ifTpJ  (p^)  >  A  for  each  {t,i,p^). 

B.l      Agreement  Theorem 

Now  we  will  prove  Theorems  3  and  4.  Towards  this  goal,  we  will  first  develop  our  notion  of  measuring  the 
size  of  the  pie.  Then,  we  will  show  that,  under  NL,  the  lengths  of  disagreement  periods  are  uniformly 
bounded.  Then,  we  will  show  that  our  result  holds  with  strict  inequalities  for  the  case  when  y  is 
identically  1.  Finally,  invoking  a  continuity  property  of  equilibrium  payofTs  with  respect  to  the  belief 
structures,  we  wiU  obtain  a  stronger  version  of  Theorem  3,  which  will  also  imply  Theorem  4. 

Measuring  the  size  of  the  pie  Define  E  :  R\  — ►  IR+,  measuring  the  size  of  each  payoff  vector 
V  >0  relative  to  U,  by  setting 

IU,II 


max{A||t;|||AvG  [/,AeR+} 
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at  each  v  G  IR^\{0}  and  setting  E{0)  =  0,  where  ||-||  is  any  given  norm  on  K^.    Thus  defined,  E  is 
continuous,  increasing,  and  homogenous  of  degree  1,  i.e.,  E(At;)  =  XT,{v)  at  each  A  >  0  and  v  £ 


Moreover,  [/nR^  =  {v|E(u)  S  [0,1]};  and  the  Pareto-frontier  is  defined  by  S(v)  =  1;  in  particular,  we 
have  T,{m^{v),v'^)  =  E(v\m^(u))  =  1.  Given  any  t,  we  write  St  =  T,  (Vj)  for  the  perceived  size  of  the 
pie  at  the  beginning  of  date  t;  we  have  an  agreement  regime  at  any  {Pi,i)  iff  St+i  {Pt,i)  <  1/^-  Given 
any  v,  we  will  write  m  (u)  =  (m'  (v)  ,m}  (v)). 

Disagreement  under  NL  Under  Assumption  NL,  the  length  of  an  interval  of  disagreement 
regimes  is  uniformly  bounded.  For,  if  we  have  a  disagreement  regime  at  any  given  t,  we  have  m{6Vt+i)  = 
6Vt+i,  and  hence  by  equation  (15),  we  have  Vt  =  6Vt+i,  yielding  5t  =  E  {Vt)  =  E  {6Vt+i)  =  6T,  {Vt+i)  = 
dSt+i-  This  holds  at  each  t  with  a  disagreement  regime;  we  must  thus  have  St  =  6  ~  S^  at  each  such 
t,  where  i  is  the  date  of  the  first  agreement  regime  after  t.  Hence,  the  length  of  such  an  interval  of 
disagreement  regimes  is 


L{Si,6)  = 


log(^t) 
\og(l/S) 


1  < 


log(E(m(0))) 


\og(l/6) 


-1  =  L{6)., 


where  the  inequality  is  due  to  the  fact  that  5;  <  E(m(0)). 


Extreme  case     Writing  1  for  the  belief  structure  p  with  pj  =  1  at  each  t  and  i,  we  now  analyze  the 
equilibrium  of  any  finite- horizon  game  G*[6, 1].  [y  is  also  identically  1.) 

Consider  the  end  of  the  game.  Since  Vj  =  0,  by  (15),  we  have  14_i  =  771(0),  and  hence  Si-i  = 
E  {m{0))  >  1.  When  5t_i  €  [1, 1/5],  we  have  an  agreement  regime  at  i  —  2.  If  St-i  >  l/S,  then  we  have 
disagreement  regimes  throughout  the  interval  {i  —  L(6) —  1,. ..  ,i  — 2};  and  we  have  an  agreement 
regime  at  i  —  L{S)  —  2,  where  L{6)  =  L{St-\,6)  as  5j_i  =  S(m(0)).  By  construction,  St_i,^^_i  € 
[1, 1/6].  Together  with  our  next  lemma,  this  will  guarantee  that  we  have  an  agreement  regime  at  each 
t<i-L{6)  -2. 

Lemma  6    Take  any  finite-horizon  game  G*[i5, 1].     Given  any  t  €  T,   if  St+i   €   [1,1/5],   then  St   £ 
[1,2-5]  C[l,l/A). 

Proof.  Since  St+i  <  1/5,  we  have  agreement  regime  at  t;  and  thus,  by  (15),  Vt  =  m{6Vt+i).  Now, 
E  (514+1)  =  5E  {Vt+i)  =  6St+i  S  [5, 1];  therefore,  it  suffices  to  show  that  E(m(u))  €  [1,2  —  5]  whenever 
S  (v)  €  [5, 1].  Given  any  v  €  IR+,  if  E  (v)  =  1,  then  m{v)  =  v,  hence  E  {m{v))  =  E  (v)  =  1  €  [1, 2  -  5]. 

Now  take  any  w  €  1R+  with  5  <  E  (v)  <  1.  By  definition,  m(v)  >  v,  hence  E  (m(v))  >  E  (v)  >  1. 
Hence,  we  only  need  to  show  that  E  {m{v))  <  2  —  5.  We  accomplish  this  using  a  geometric  construction. 
See  Figure  3  for  our  notation  and  graphical  exposition.  In  addition,  we  write  CONE  for  the  convex  huU 
of  n  and  r2.  [Note  that  v,  wi,  Tn{v),  W2  €  CONE.]  We  will  show  that  m{v)  <  C.  Since  m.{v)  €  CONE 
and  E  is  increasing  and  convex,  this  implies  E  (m(w))  <  max  {E  ((2  —  5)wi) ,  E  ((2  —  6)102)}  =2  —  6, 
completing  the  proof. 

To  prove  that  m{v)  <  C,  we  first  note  that,  since  v  <  rn{v),  line  I  has  a  positive  slop  so  that  A, 
B,  C,  V,  and  m{v)  are  all  linearly  ordered.  In  this  order,  clearly,  we  have  A  <  B  <  C.  Moreover,  B 
is  the  center  of  the  rectangle  defined  by  v,  wi,  rn{v),  and  W2,  i.e.,  B  =  v/2  +  m{v)/2  =  Wi/2  +  W2/2. 
Hence,  we  have  v  <  B  <  m,{v).  Furthermore,  we  have  A  <  v.  [For  otherwise,  we  would  have  v  <  A  <  B 
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Figure  3:  An  illustration  for  Lemma  6. 
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so  that  A  £  CONE  and  hence  A  =  \6wi  +  (1  —  X)6w2  for  some  A  €  [0, 1].  Since  6U  is  convex,  we 
would  then  have  A  €  6U,  i.e.,  T,{A)  <  S.  But  v  <  A  also  impHes  that  6  <  T,{v)  <  T,{A);  we  have  a 
contradiction.]  On  the  other  hand,  since  hnes  I,  61,  and  {2  —  b)l  are  parallel  to  each  other,  we  have 

\\B-A\\_        \\w,-bw,\\        _      \l-6\\\w,\\      _^_  ^^g^ 


IIC-BII       \\{2-8)w,-w,\\       12-5-lllKII         • 
Hence, 

\\m{v)  -  B\\  =  \\B  -  v\\  <  \\B  -  A\\  =  \\C  -  B\\ , 

where  the  first  equality  is  due  to  B  =  v/2  +  m{v)/2,  the  second  one  is  due  to  B  >  v  >  A,  and  the  last 
one  is  due  to  (43).  Since  C  >  B  and  since  m{v)  and  C  are  ordered,  this  implies  that  m{v)  <  C.   ■ 

Since  S{_ir^\^  G  [1,  l/6\,  together  with  mathematical  induction,  Lemma  6  gives  us  the  following 
Lemma. 

Lemma  7   Given  any  G*[i5, 1]  urith  i  <  oo,  at  each  t  <  i—  L  (6)  —3,  we  have  St+i  €  [1,  2  —  6]  C  [1,  l/S). 

This  is  Theorem  3  for  the  extreme  case  of  p  =  1  with  strict  inequalities.  On  the  other  hand, 
continuation  values  are  continuous  with  respect  to  the  probability  assessments  p,  as  our  next  Lemma 
states.  Since  we  can  also  bound  p  to  be  close  to  1  by  bounding  y  from  below,  this  gives  us  Theorem  3, 
again  with  strict  inequalities. 

Lemma  8  Given  any  6  €  (0, 1),  any  i  G  N,  t  Q  T,  and  any  e  >  0,  there  exists  some  A  >  0  such  that 
\Vt\p]  (Pt)  -  Vt[q]  (pi)|  <  e  whenever  |p  -  g|  <  A. 

Proof.  We  use  mathematical  induction.  Clearly,  Vj [p]  =  0  is  continuous  in  p.  Take  any  t  €  T  and 
Pj.  By  definition,  we  have  Vt'[p]  (pj  =  pj  (pj  m' {5Vi+i[p]  (p(,i))  +  (l  -pj  (Pt))  ^^<+i[p]  (Pt>j)  at  each 
belief  structure  p,  and  each  i  £  N.  Assume  that  both  p  i— ►  Vi_|_i[p]  (pj,  1)  and  p  >-^  Vj+i[p]  (Pt,2)  are 
continuous.  Since  m^  :  M^  —*  M.  and  the  projection  mappings  p  >—*  p]  (pt)  and  v  i--»  v'^  are  all  continuous, 
P  >— ♦  ^i[p]  (Pt)  must  be  continuous  as  well.  By  induction  hypothesis  this  completes  the  proof.    ■ 

Lemmas  7  and  8  gives  us  Theorem  3  with  strict  inequalities: 

Theorem  6  Given  any  6  €  (0, 1)  and  any  finite  i  >  2,  there  exists  some  y*  <  1  such  that,  for  every 
game  G*[8,p]  with  y  >  y* ,  we  have  St+i  (Pt+i)  <  {2  —  6  +  1/(5) /2  <  1/6  (and  hence  an  agreement 
regime)  at  each  date  t  <t  —  L{6)  —  2>  and  each  Pj+j . 

Proof.  If  t  —  L  (5)  —  3  <  0,  then  our  Theorem  is  vacuously  true;  so,  assume  that  i  —  L  (5)  —  3  >  0. 
Define  Teariy  =  {<  S  T\t  <  f  —  L((5)  —  3},  which  is  finite  and  non-empty.  Given  any  t  €  T^ariy,  by 
Lemma  7,  E(Vt+i[l])  <2  —  6  <  1/6;  and  since  E  is  continuous,  there  exists  some  et  >  0  such  that 
S  {Vt+i\p]  {Pt+i))  <{2-6  +  1/6)  /2  whenever  \Vt+i\p]  (p^+i)  -  Vt+i[l]\  <  £«.  But  by  Lemma  8  there 
also  exists  some  Ap^^^  >  0  such  that  |yt+i[p]  {Pt+i)  —  ^+i[lll  <  (t  whenever  |p—  1|  <  -^p.+j,  i.e., 
pi  (Ps)  >  1  -  ^P,+,  for  each  (s,  p^,  i).  Set  A  =  miup^^^, teT,„wv  ^p,+,  >  0  and  y*  =  1  -  A  <  1.  Take 
any  p  with  y  >  y*.  Now,  given  any  (s,  p^,  ?'),  and  any  P(_,_i  with  t  €  Tgariy,  we  have  pl  (p^)  > 
1  -  P^  (Pj  +  Vs  (ps)  >  Vs  (Ps)  >2/*  =  l-A>l-  Ap,^, ,  where  j  /  i.  Therefore,  at  each  pj+i  with 
t  G  Teariy,  We  havc  \Vt+,\p]  (p,+  i)  -  Vt+i[l]\  <  eu  yielding  E  {V^+M  (p,+i))  <  (2  -  -5  +  1/6)  /2.    ■ 
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Proof.  {  Theorem  4)  Take  any  6  >  0,  and  let  ei  =  1/6  -  (2  -  6  +  1/6)  /2  >  0.  Since  E  is  continuous, 
by  (21),  for  each  t  <  f*,  there  exists  some  fj  e  N  such  that 

|S,+i  [co,p\  -  St+i  [f,p]|  =  \J:(Vt+i  [^,P])  ~^{Vi+i  [f,p])|  <  ei  (Vp)  (44) 

whenever  i  >  iV  This  is  true  in  particular  for  i  >  i  =  max  {t*  +  L  [6)  +  3,maxt<(-  it}-  By  Theorem 
6,  there  also  exists  some  y*  <  1  such  that,  for  every  G''[<5,p]  with  y  >  y*,  we  have  St+i  [i,p]  (pt+i)  ^ 
(2  -  5  +  1/6)  /2  =  1/^  -  €i  at  each  p,_,_i  with  t  <  f.  In  that  case,  by  (44),  we  will  have  St+i  [oo,p]  < 
St+i  [Up]  +  ii  <  1/^,  i-C-,  an  agreement  regime  at  each  t  <  t* .    u 

B.2      A  stylized  model  with  learning 

Now  we  will  present  the  proofs  omitted  in  Section  6.  The  results  in  this  section  are  stated  for  the 
infinite-horizon  case.  Here,  we  will  first  consider  the  finite-horizon  case  and  then  take  the  limit.  We 
write  N  =  (Nu  {oo})\  {0}.  We  will  first  prove  the  following  Lemma. 

Lemma  9  For  any  6  e  {0,1)  and  i  e  N,  assume  TU,  XB  and  that  R\  [t\  =  R'^  [t\  =  Rs  [i\  G  R  for  each 
s  >  t  for  some  t  <  i.  Also,  write  Aj  =  X^^lj  5^~'i?s  [i].  Then,  given  any  p(  S  A''*  and  any  i  €  A'^, 
Vt'  [t\  iPt)  =  P\  (Pt)  M  and  St  [t\  [pt)  =  (1  +  y(i))  A[. 

Proof.  Under  our  hypothesis,  given  any  pj  and  s  >t,  we  have  £'  \l^p^-^^R\  [i]  |pj]  =  Rs  [<]  E''  [l{p,=i}|P(] 
Rs  [t]  P*  {ps  =  i\Pt)  =  Rs  [^  f  [Pt  =  APt)  =  Pt  {pt)Rs  [f|>  where  the  penultimate  equality  is  due  to 
the  fact  that  player's  beliefs  about  different  dates  are  identical.    Hence,  by  Proposition  2,  we  have 
Vl  [<]  {Pi)  =  ^\=t  6'~'E^  [^{P,=^}Rl  [t\  \Pt]  =  Pi  {Pt)K-  This  also  impHes  that  St  [t\  (p,)  =  y/  [i\  (p,)  + 
Vt^  [t\  (Pt)  =  pj  iPt)  Aj  +  P?  iPt)  A[  =  {pI  iPt)  +  P?  iPt))  A[  =  (1  +  Vd))  Aj.    ■ 

Lemmas  3  and  9  imply  Lemma  4  as  an  immediate  corollary.  We  now  prove  Lemma  3. 

Proof.  {Lemma  3)  We  will  prove  the  lemma  for  any  t  €  N.  We  first  consider  the  case  t  €  N 
and  use  mathematical  induction:  Firstly,  R\_-y  [i]  =  Fi^_i  [^1  =  li  which  is  deterministic.  Assume 
that  R]  [*]  =  ii^^  =  -^3  [«]  e  K  for  each  s  >  <  +  1  for  some  t  <  t.  Then,  by  Lemma  9,  St+i  \t\  = 
(l  +y(t+i))  A(^j  G  E  and  hence  R\\t\  =  R^^  =  max{0, 1  —  6St+\  [<]}  €  K,  showing  by  mathematical 
induction  that  R\^  =  iZf  [i]  =  Rt^  for  some  i?t  (^  €  M  for  each  t  <  t.  In  that  case,  by  Lemma  9, 
St  [i]  =  (l  +2/(f))  Aj  €  IR  for  each  t  <t,  proving  our  Lemma  for  finite-horizon  case.  In  the  infinite- 
horizon  case,  as  ^  — ►  oo,  we  will  show  that  Vj  [i]  — »  14  [oo],  and  hence  St  [t]  — »  St  [oo]  and  Rt  [i]  — ►  Rt  [oo] 
at  each  t,  showing  that  St  [oo]  and  Rt  [oo]  are  also  deterministic.    ■ 

To  see  that  Vt  [f]  — >  Vt  [oo],  first  observe  that  A'  =  1  —  6y(t+\)^\+\  and  A'  =  6h\^^  when  we  have 
agreement  and  disagreement  regimes  at  t.  Using  the  fact  that  |5y((+i)|  <  6  <  1,  one  can  check  from 
these  equalities  that  A*  is  strongly  stable  backwards.  Since  A|_j  =  ilt-i  [*]  =  1,  A'  thus  possesses  a 
pointwise  limit  A°°.  Therefore,  Vt  [i]  (pj  — >  Vt  [oo]  (pj)  =  p\  {pt)  A^  everywhere. 

This  also  shows  that  s  [oc]  is  an  equilibrium.  From  now  on,  we  will  take  4  =  oo  and  suppress  i  in 
our  expressions. 

Towards  proving  Lemma  5,  let  us  define  functions  B,  B,  and  C  by  setting  B{t)  =  ^^}      ,  B{t)  = 

B{t  +  1)  and  C{t)  =  ^^^T+I^T^  ^^  ^^^^  *  ^  ^-  ^°^^  *^^t  ^(^)  <  ■^(*)  <  '^(*)  ^*  ^^^^  *•  ^°^  *^^ 
last  inequality,  one  can  check  that 

C{t)  -  m  =         V(.-i)  (1  -  ^y(m))  ^  0 
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We  start  with  the  following  Lemma: 

Lemma  10    Given  any  t  €  PA,  and  6,  c  G  R,  we  have 


At+i     =     B{t  +  l)-b  ^^  At  =  B{t)+6y^t+i)b,  (45) 

At+i     =     C{t  +  l)+c  ^^  At  =  B{t)-6y^t+l)C.  (46) 


Proof.  Since  At  =  1  —  6y(^t+i}At+i,  we  have 

r  1  1  1 

+  5y(t+i)&, 


A(+i  =  -——. b  <=>  At  =  1  -  6y^t+i) 

1  +  oy^i+i) 


1  +  <5j/(t+i) 


1  +  6y{t+i) 


showing  (45).  Likewise, 


y(t+i)  1  +  ^y(t)  y(t+i)  1  +  <5j/(t)  i  +  <5y(t) 


showing  (46).    ■ 

Note  that,  whenever  At+i  >  B{t  +  1),  we  have  Aj+i  =  B{t  +  1)  —  b  for  some  6  <  0,  and  hence  we 
have  At  =  B{t)  +  8y(t+i)b  <  B{t).  Likewise,  whenever  At+i  <  C{t  +  1),  we  have  At+i  =  C{t  +  1)  +  c 
for  some  c  <  0,  and  hence  we  have  At  =  B{t)  —  5j/(t+i)C  >  B{t).  This  gives  us  the  following  Lemma. 

Lemma  11    Given  any  t  e  PA,  if  B{t  +  1)  <  At+i  <  C{t  +  1),  then  B{t)  <  At  <  B{t). 

Lemma  12   For  any  t  €  PA,  B{t)  <  At  <  C{t). 

Proof.  Take  any  t  £  PA,  and  write 

1  iit  =  s 


for  each  s  >  t.  Then,  using  Lemma  10,  we  compute  that 

A,  =  C{t)  +  e*+2'  [At+2i  -  C{t  +  21)]  -      Y^     ^*+2fc  ^^(j  ^  2k)  -  B{t  +  2k)]  .  (47) 


0<fc<i-l 


for  each  t  €  PA,  and  I  >  0.  Using  Lemma  10  and  mathematical  induction  on  I,  one  can  easily  check 
that  (47)  holds. 

Equation  (47)  implies  that  At  <  C{t)  when  /  is  sufficiently  large.  To  see  this,  note  first  that,  since 
|— ^2/(f)|  <  1,  as  Z  — ►  CO,  ^t^^'  — >  0.  Since  we  further  have  |Af+2(  —  C{t  +  2l)\  <  1  at  each  t,l,  it 
follows  that,  as  Z  — »  oo,  ^*"^^'  [At+2i  -  C{t  +  21)]  — »  0.  Second,  ^*+^''  >  0  for  each  k,  as  it  consists  of 
multiplication  of  evenly  many  negative  numbers.  Since  C{t  +  2k)  —  B{t  +  2k)  is  always  positive,  it  follows 
that  X^o<fc<;-i  ^t^  [*-^(*  +  2A;)  —  B{t  +  2fc)]  is  positive,  increasing  in  I,  and  hence  bounded  away  from 
zero.  Therefore,  there  exists  Z'  €  N  such  that 

61+"^'  [At+2i  -  C{t  +  21)]  -      Y,     ^t"^^*"  ["^(^  +  2't)  -  B{t  +  2k)]  <  0  (48) 

o<fc<;-i 
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whenever  /  >  /'. 

On  the  other  hand,  using  (47)  at  t  +  1  and  (46),  one  can  also  obtain 

A,     =     B{t)-6y^,+,)elXf+'[\t+2i+i-C{t  +  2l  +  \)] 

+6y(t+i)     Y.     <^lVi^^' [C(t  + 1 +  2k)-B{t  +  l  +  2k)].  (49) 

0<fc</-l 

Of  course,  by  (48),  there  exists  some  I"  €  N  such  that 

-6y{t+iAtf^'  [A(+2i+i  -  C{t  +  21  +  1)]  +  6y^t+,^      Y.     ^t+f  ^'  [^(*  +  2^'  +  1)  "  ^(*  +  '2k +1)]>0 

0<k<l-l 

(50) 

whenever  I  >  I",  whence  we  have  At  >  B{t).  Therefore,  for  any  I  >  Tnax{l' ,  I" } ,  inequaUties  (48)  and 
(50)  simultaneously  hold,  hence  by  (47)  and  (49),  we  have  B{t)  <  At  <  C{t).    u 

We  can  now  prove  Lemma  5. 

Proof.  {Lemma  5)  When  t  ^  PA,  our  Lemma  is  vacuously  true.  Take  any  t  e  PA.  By  Lemma  12, 
we  have  B{t  +  1)  <  A(+i  <  C{t  +  1),  hence  by  Lemma  11,  we  have  B{t)  <  Aj  <  B{t).    U 
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